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METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR ACTIVITY 
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INTRODT TCTTfVM 
Technical Field 

The present invention relates to improved methods and compounds for modulating 
nuclear receptor activity. In particular, the present invention relates to improved methods and 
compounds for modulating estrogen receptor activity. 

Background 

Cells contain receptors that can elicit a biological response by binding various molecules 
including proteins, hormones and/or drugs. Nuclear receptors represent a super family of 
proteins that are hormone/ligand-activated transcription factors that enhance or repress 
transcription in a cell type-, ligand- and promoter-dependent manner. A classic nuclear receptor, 
the estrogen receptor a (ERa) is a key factor in regulating the differentiation and maintenance of 
neural, skeletal, cardiovascular and reproductive tissues (Korach, Science 266:1524-1527 
(1994); Smith, et al., Mew Engl- J. Msd, 331 :1056-1061 (1994)). The nuclear receptor family 
also includes receptors for glucocorticoids, androgens, mineralocorticoids, progestins, thyroid 
hormones, vitamin D, retinoids, peroxisome proliferators and eicosanoids. A subset of the 
nuclear receptor family are the steroid receptors, which include the estrogen, glucocorticoid and 
progestin receptors. 

Overall sequence conservation between nuclear receptors varies between different 
families of receptors; however, sequence conservation between functional regions, or modules, 
of the receptors is high. For example, nuclear receptors can be organized into functional 
modules comprising an N-terminal transcriptional activation domain, a central DNA binding 
domain (DBD), and a C-terminal ligand binding domain (LBD). The LBD of nuclear receptors 
represents a hormone/ligand-dependent molecular switch, and recognizes a variety of 
compounds diverse in their size, shape and chemical properties. Accordingly, the estrogen 
hormones exert their physiological effects by binding to the estrogen receptor (Beato, et'al., £ell 
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83(6):85 1-857 (1995); Tsai. et al.. Annu . Rev. Biochem . 63:451-86 (1994)). Binding of a 
hormone to a nuclear receptor's LBD also changes its ability to modulate transcription of DNA, 
although they may have transcription-independent actions- 
All ERcc ligands bind exclusively to the C-terminal LBD. Some of these ligands. 
including the endogenous estrogen. 1 7p-estradiol (E 2 ), and the synthetic nonsteroidal estrogen, 
diethylstilbestrol (DES), function as pure agonists whereas others such as ICI- 164,384 function 
as pure antagonists. Synthetic ligands such as tamoxifen and raloxifene (RAL) belong to a 
growing class of molecules known as selective estrogen receptor modulators (SERMs), which 
function as antagonists in specific tissue and promoter contexts (Grese, et al., Proc. Natl. Acad. 
Sd. USA 94:14105-10 (1997)). The remarkable tissue-specific behavior of tamoxifen was 
recently demonstrated in a breast cancer prevention trial, reported in Smigel, J. Natl. Cancer Inst. 
90:647-8 (1998), where a group of women at high risk for breast cancer who received tamoxifen 
treatment over a six year period, exhibited an increased incidence of endometrial cancer but a 
reduced occurrence of certain bone fractures and a dramatic 45% reduction in breast cancer 
incidence. The rational design of new SERMs and the optimization of existing ones require an 
understanding of the effects of different ligand chemistries and structures upon ERct 

transcriptional activity. 

Nuclear receptors also bind proteins, such as chaperone complexes, corepressors, or 
coactivators, that are involved in receptor function. In particular, ligand-dependent activation of 
transcription by nuclear receptors is mediated by interactions with coactivators. Receptor 
agonists promote coactivator binding and antagonists block coactivator binding. Hormone 
binding by a nuclear receptor can increase or decrease binding affinity to these proteins, and can 
influence or mediate the multiple actions of the nuclear receptors on transcription. 

Transcriptional activation by ERa is mediated by at least two separate activation 
functions (AFs) located within different domains of the protein, AF-1 in the N-terminus, and AF- 
2 in the LBD. These AFs can act independently or cooperatively, depending on the cell type and 
the promoter context. The activity of AF-1 is regulated by growth factors acting through the 
MAP kinase pathway (Kato, et al.. Science 270:1491-1494 (1995)) and is generally believed to 
be activated in a ligand-independent manner, while AF-2 activity ("transcriptional activity") is 
responsive to ligand binding (Kumar, et al.. Cejl 51 (6):94 1-951 (1987)). The binding of agonists 
triggers transcriptional activity whereas the binding of antagonists does not (Berry, et al., EMBO 
J. 9:281 1-8 (1990)). In addition, coactivators mediate transcriptional activity. The structural and 
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functional nature of the site to which coactivators bind has only recently been defined. Apriletti. 
et al., US Provisional No. 60/079,956, filed March 30. 1998, the disclosure of which is 
incorporated herein by reference. 

Recent structural studies suggest that ligands regulate transcriptional activity by directly 
5 affecting the structure of the LBD. Comparison of the structure of the unliganded human 

retinoid X receptor a LBD (Bourguet, et al.. Nature 375:377-82 (1995)) with the structures of 
the liganded LBDs of the human retinoic acid receptor y (RARy) (Renaud, et al.. Nature 
378:681-689 (1995) and Wurtz, et al., Nat. Struct . Biol . 3:87-94(1996)), the thyroid hormone 
receptor a (TRa) (Wagner, et al., Nature 378:690-697 (1995)), the progesterone receptor 
10 (Williams, et al., Nature 393:392-395 (1998)) and the ERa (Brzozowski, et al., Nature 389:753- 
758 (1997); Tanenbaum, et al., Proc . Natl. Acad . Sci. USA 95:5998-6003 (1998)) suggests that 
an agonist-induced conformational change involving the repositioning of helix 12. the most C- 
terminal helix of the LBD, is essential for transcriptional activity. Because certain point 
mutations in helices 3, 5 and 12 abolish transcriptional activity but have no effect on ligand or 

1 5 DNA binding, these regions of the LBD have been predicted to form part of a recognition 

surface, created in the presence of agonist, for molecules that link the receptor to the general 
transcriptional machinery (Danielian, et al., EMBO J. 1 1: 1025-33(1 992); Feng, et al.. Science 
280:1747-9 (1998); Henttu, et al., MoJ. CelL BjoJ. 17:1832-9 (1997); Wrenn, et al., J. Biol . 
Chem. 268:24089-24098 (1993)). The structures of the LBD complexed with E 2 and RAL show 

20 that although both ligands bind at the same site within the core of the LBD (Brzozowski. et al., 
suEra), each of these ligands induces a different conformation of helix 12. Whereas helix 12 in 
the E 2 -LBD complex packs against the helices 3. 5/6 and 1 1 in a conformation that has been 
observed for the corresponding helix in other agonist-bound NR LBD structures, helix 12 in the 
RAL-LBD complex is bound in a hydrophobic groove composed of residues from helices 3 and 

25 5. This alternative orientation of helix 12 partially buries residues in the groove that are 

necessary for transcriptional activity, suggesting that RAL and possibly other antagonists block 
transcriptional activity by disrupting the topography of the coactivator binding site surface. 

Biochemical and genetic approaches have led to the identification of several proteins that 
associate in a ligand-dependent manner with ERa (Horwitz, et al., Mol. Endocrinol 10:1 167- 

30 1 1 77 (1 996)) including SRC-1 /N-CoA 1 (Onate. et al.. Science 270:1 354-1 357 ( 1 995)), 

GRIP1/TIF2/SRC-2 (Hong, et al., Proc . Natl . Acad . Sci . USA 93(I0):4948-4952 (1996) and 
Voegel, et al. EMBO J. 15:3667-3675 (1996)). p/CIP/RAC3/ACTR/AIBl/SRC-3 (Anzick. et al.. 
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Science 277:965-968(1997). Chen, et aL Cell 90(3):569-80 (1997), Li, et al., Proc. Natl. Acad. 
Sci . USA 94:8479-84 (1997) and Torchia. et aL Nature 387:677-684 (1997)) and CBP/p300 
(Hanstein, et ah, Proc . Natl . Acad . Sci. USA 93:11 540- 1 1 545 ( 1 996)). These proteins have been 
classified as transcriptional coactivators because they enhance ligand-dependent transcriptional 
5 activation by ERa as well as by several other NRs (Glass, et al., Curr . Opin. CeU Biol. 9:222-32 
(1997); Torchia, et aL, supra ). The observation of partial hormone resistance in mice with a 
disrupted SRC-1 gene (Xu. et al., Science 279: 1922-1925(1998)) provides compelling evidence 
that coactivators are required for NR function in vivo. Consistent with its proposed role in AF-2 
directed transcriptional activation, SRC-1 possesses histone acetylase activity and the ability to 
10 interact not only with agonist-bound receptors but also with other coactivators and several 
general transcription factors (Kameu et al., Cell 85(3):403-14 (1996); Onate, et al., supra; 
Spencer, et al., Nature 389:194-8 (1997); Takeshita, et al., Endocrinology 137:3594-7 (1996)). 
SRC-1 and GRIP1 also bind to the agonist-bound LBDs of both the human TR(5 and human ERa 
using the putative coactivator binding site (Feng, et al., supra). 
1 5 Members of the pi 60 family of coactivators such as SRC-1 , GRIP 1/TIF2/SRC-2, and 

p/CIP/RAC3/ACTR/AIBl/SRC-3 as well as other coactivators recognize agonist-bound NR 
LBDs through a short signature sequence motif, LXXLL (SEQ ID NO:l) (where L is leucine and 
X is any amino acid), known as the NR box (Ding, et al., Mol. Endocrinol. 1 2:302-3 13 (1 998); 
Heery, et al ^ Nature 387:733-736 (1997); Le Douarin, et aL, EMBO J. 15:6701-15 (1996); 
20 Torchia, et al., supra ). Mutagenesis studies indicate that the affinity of coactivators for NR 
LBDs is determined principally, if not exclusively, by these NR boxes (Ding, et al., supra); 
Heery, et al., Nature 387:733-736 (1997); Le Douarin, et al.. EMBO J. 15:6701-15 (1996); 
Torchia, et aL, supra ). Each of the pi 60 coactivators contains several NR boxes. The NR boxes 
within SRC-1, GRIP1 and TIF2 have been demonstrated to recognize different NRs with 
25 different affinities (Ding, et al., supra ; Kalkhoven, et al., EMBO J. 17:232-43 (1998); Voegel, et 
al., EMBO J. 17:507-19 (1998)), but the reasons for these binding preferences are unknown. 

Darimont, et al., "Structure and specificity of nuclear receptor-coactivator interactions" 
Genes Dev . 12:3343-3356 (1998) describes structural studies of the complex between TR(3 and 
the GRIP1 NR Box II peptide and biochemical studies of GRIP 1 binding to TR(3 and GR. The 
30 PPARy/SRC-1 peptide complex is described in Nolte, et al.. Nature 395:137-143 (1998). 

The medical importance of nuclear receptors is significant. They have been implicated in 
breast cancer, prostate cancer, cardiac arrhythmia, infertility, osteoporosis, hyperthyroidism, 

4 
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hypercholesterolemia, obesity and other conditions. For example, compounds that modulate 
ERot transcriptional activity are currently being used to treat osteoporosis, cardiovascular disease 
and breast cancer (Gradishar. et al., J. Clin . Oncol . 1 5:840-52 (1997) and Jordan. J. Natl . Cancer 
inst. 90:967-71 (1998)). 

5 A need continues to exist for further identification and characterization of the key 

residues within the ligand binding domains of the nuclear receptors, and molecules that affect the 
receptor by binding to these sites. Understanding these interactions provides a basis for iterative 
drug design, synthesis, and selection. It also would be advantageous to devise methods and 
compositions for reducing the time required to discover compounds that target these binding 

10 sites and administer them to organisms to modulate physiological processes regulated by the 
nuclear receptors, and the estrogen receptor in particular. 

SUMMARY OF THE INVENTION 
The present invention relates to the further identification and manipulation of the ligand 
binding domain (LBD) of nuclear receptors, which facilitates the design of compounds that bind 

1 5 to the LBD and modulate nuclear receptor activity, and the estrogen receptor in particular. The 
compounds include agonists and antagonists that modulate nuclear receptor activity, and can be 
receptor-, cell- and/or tissue-specific. In particular, the compounds modulate nuclear receptor 
activity by affecting coactivator-coactivator binding site interactions. 

The present invention also includes protein cocrystals of the nuclear receptors with an 

20 agonist bound to the LBD and a peptide bound to the coactivator binding site and methods for 
making them. Similarly, the invention also includes protein cocrystals of the nuclear receptors 
with an antagonist bound to the LBD and methods for making them. The cocrystals provide 
means to obtain atomic modeling information of the specific amino acids and their atoms 
forming the LBD and coactivator binding sites and that interact with molecules that bind to the 

25 sites. The cocrystals also provide modeling information regarding the ligand:nuclear receptor 
and coactivatonnuclear receptor interactions, as well the structure of ligands bound thereto. 

The present invention further provides methods for identifying and designing molecules 
that modulate ligand binding to a nuclear receptor using atomic models of nuclear receptors. 
The method involves modeling test compounds that fit spacially into a nuclear receptor LBD 

30 using an atomic structural model comprising a nuclear receptor LBD or portion thereof, 

screening the test compounds in an assay, such as a biological assay, characterized by binding of 
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a test compound to the nuclear receptor LBD. and identifying a test compound that modulates 
ligand binding to the receptor. 

The invention also includes compositions and methods for identifying key residues 
within the LBDs of nuclear receptors. The methods involve examining the surface of a nuclear 
5 receptor of interest to identify residues that modulate ligand and/or coacti vator binding. The 
residues can be identified by homology to the key residues within the LBD of human ERa 
described herein. Overlays and superpositioning with a three dimensional model of a nuclear 
receptor's LBD, and/or a portion thereof, also can be used for this purpose. Additionally, 
alignment and/or modeling can be used as a guide for the placement of mutations on the LBD 

10 surface to characterize the nature of the site in the context of a cell. 

Also provided is a method of modulating the activity of a nuclear receptor. The method 
can be in vitro or in vivo. The method comprises administering in vitro or in vivo a sufficient 
amount of a compound that binds to the ligand binding domain and acts either as an agonist or an 
antagonist. Preferred compounds bind to the site with greater affinity than ligands found in a cell 

15 of interest. 

The invention further includes a method for identifying an agonist or antagonist of ligand 
binding to a nuclear receptor. The method comprises providing the atomic coordinates 
comprising a nuclear receptor ligand binding domain or portion thereof to a computerized 
modeling system; modeling compounds which fit spatially into the nuclear receptor ligand 

20 binding domain; and identifying in an assay, for example a biological assay, for nuclear receptor 
activity a compound that increases or decreases activity of the nuclear receptor through binding 
the ligand binding domain. 

Also provided is a machine-readable data storage medium with information for 
constructing and manipulating an atomic model comprising the ligand binding domain or 

25 portions thereof. The medium comprises a data storage material encoded with machine readable 
data which, when using a machine programmed with instructions for using said data, is capable 
of displaying a graphical three-dimensional representation of a molecule or molecular complex 
for a nuclear receptor ligand binding domain. 

Also provided is a method of identifying a compound that selectively modulates the 

30 activity of one type of nuclear receptor compared to other nuclear receptors. The method is 
exemplified by modeling test compounds that fit spatially and preferentially into a nuclear 
receptor ligand binding domain of interest using an atomic structural model of a nuclear receptor 

6 
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LBD. selecting a compound that interacts with one or more residues of the LBD unique in the 
context of that site, and identifying in an assay, for example a biological assay, for ligand 
binding activity a compound that selectively binds to the LBD compared to other nuclear 
receptors. The unique features involved in receptor-selective ligand binding can be identified by 
^ comparing atomic models of different nuclear receptors or isoforms of the same type of receptor. 

The invention finds use in the selection and characterization of peptide, peptidomimetic. 
as well as other compounds, such as small organic molecules, identified by the methods of the " 
invention, particularly new lead compounds useful in treating nuclear receptor-based disorders 
in particular steroid receptor-based disorders, and more specifically estrogen receptor-based ' 
1 0 disorders. 

BRIEF DES CRIPTION OF THE PR A WTNP.S 
Figure 1 provides a stereo view of the electron density of the complexes, where Figure 
1 A is a stereo view of the electron density of the DES-ERcc LBD-GRIP1 NR Box II peptide 
complex and Figure IB is a stereo view of the electron density of the OHT-ERct LBD complex. 
5 Figure 1 is a black and white graphical representation of a figure that was generated using 
BOBSCRIPT (Esnouf, J. Mo\. Oral*. Model. , 5 . 132-4, 1 12-3 (1997)) and rendered using 
Raster3D (Merritt, et al., Acta Crvstallogr . D 50:869-873 (1994)). 

Figure 2 was generated using BOBSCRIPT and rendered using Raster3D as described 
above. Figure 2A shows the overall structure of the DES-ERcc LBD-GRIP] NR Box II peptide 
complex in two orthogonal views. Figure 2B shows the overall structure of the OHT-ERct LBD 
complex in two orthogonal views similar to those of the agonist complex in Figure 2A. 

Figures 3A and 3B were generated using BOBSCRIPT and rendered using Raster3D as 
described above. Figures 3C and 3D were created using GRASP (Nicholls. GRASP Manual 
(New York: Columbia University, 1992)). Figure 3A shows a close-up view of the coactivator 
pept.de bound to the LBD, i.e., the NR Box II peptide/LBD interface. The regions of the LBD 
that do not interact with the peptide have been omitted for clarity. Helices 3. 4 and 5 are labeled 
H3, H4 and H5 respectively. The side chains of receptor residues which interact with the peptide 
are depicted, except for Lys 362 (blue) and Glu 542 (red), the side chains are colored by atom 
type (carbon and sulfur atoms are colored green, oxygen atoms are colored red and nitrogen 
atoms are colored blue). Helix 12 is colored magenta. The peptide, colored gold, is depicted as 
a Coc worm; only the side chains of He 689 and the three motif leucines (Leu 690. Leu 693 and 
Leu 694) are drawn (Figure 3C). Figure 3B shows the helix 12/LBD interface as a close-up view 



20 
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of the OHT-LBD complex showing helix 12 bound to part of the coactivator binding site. Only 
the side chains of residues that interact with helix 12 are drawn (with the exception of side chain 
of His 373 which is omitted for clarity). Except for Lys 362 (blue) and Glu 380 (red), the side 
chains are colored by atom type as specified for Figure 3 A. Residues 530-551 are depicted as a 
Ca worm; residues 536-544 are colored magenta. The side chains of Leu 536, Tyr 537. Leu 
540, Met 543 and Leu 544 are shown. Figure 3C is a molecular surface representation showing 
the electrostatic surface of the ERa LBD bound to the NR Box II peptide as positive (blue) and 
negative (red) regions, as calculated in GRASP. The coactivator peptide is depicted as in Figure 
3 A and the view is equivalent to that in Figure 3 A. The side chains of Leu 690 and Leu 694 are 
bound in a hydrophobic groove and those of He 689 and Leu 693 rest against the edge of this 
groove. Figure 3D shows the electrostatic surface of the ERa LBD complexed with OHT, 
showing positive (blue) and negative (red) regions as in Figure 3C. Residues 530-5 11 are 
depicted as in Figure 3B and the view is equivalent to that in Figure 3B. Whereas the side chains 
of Leu 540 and Leu 544 are embedded in the hydrophobic groove, that of Met 543 lies along the 
edge of this groove. 

Figure 4 was generated using LIGPLOT (Wallace, et al., Protein Eng. 8:127-34 (1995)) 
and provides schematic diagrams illustrating the DES interactions with the LBD (Figure 4A) and 
OHT interactions with the ligand binding pocket (Figure 4B). Residues that interact with the 
ligands are drawn at approximately their true positions. The residues that form van der Waals 
contacts with ligand are depicted as labeled arcs with radial spokes that face towards the ligand 
atoms with which they interact. The residues that hydrogen bond to ligand are shown in ball- 
and-stick representation. Hydrogen bonds are represented as dashed cyan lines and the distance 
of each bond is given. The ligand rings and the individual ligand atoms are labeled. 

Figure 5 was generated using BOBSCRIPT and rendered using Raster3D as described 
above, and shows a comparison of helix 12 from the OHT complex and the NR Box II peptide. 
Figure 5A and Figure 5B are stereo views. The structures of the OHT-LBD complex and the 
DES-LBD-NR Box II peptide complex were overlapped using the Ca coordinates of residues 
306-526 of the LBD. Helix 12 from the DES-LBD-coactivator peptide complex is omitted for 
clarity. Residues 536-551 (helix 12=residues 536-544) from the OHT-LBD complex are colored 
magenta and the peptide is colored gold. The hydrogen bonds between the e-amino group of Lys 
362 and the backbone carbonyls of residues 543 and 544 of helix 12 are illustrated as dashed 
magenta lines. The hydrogen bonds between the E-amino group of Lys 362 and the backbone 

8 
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carbonyls of residues 693 and 696 of the coactivator peptide are depicted as dashed orange lines. 
The following abbreviations are used on helix 12: L540=Leu 540. M543=Met 543. and 
L544=Leu 544. The following abbreviations are used on the peptide: L690=Leu 690, L693=Leu 
693 and L694=Leu 694. 

Figures 6A and 6D were generated using BOBSCRIPT and rendered using Raster3D as 
described above. Figures 6B and 6C were created using MidasPlus (Huang, et al., J. Mol. Graph. 
9:230.6. 242 (1991)). Figure 6A shows that agonists and antagonists promote different LBD 
conformations, as ribbon representations of the DES complex (without the coactivator peptide) 
the OHT complex and the E 2 complex such as is described in Tanenbaum, et al., supra. The 
hormones are shown in space-filling representation. In each complex, helix 12 is colored 
magenta and the main chain of residues 339 to 341, 421 to 423, and 527 to 530 is indicated in 
red. Helices 3, 8 and 1 1 (H3, H8 and HI 1 respectively) are labeled in the DES complex. Figure 
6B shows DES bound in the ligand binding cavity. A cross-section of a space-filling model of 
the LBD bound to DES (green) showing the ligand completely embedded in the ligand binding 
cavity. The A' ring of DES (A'), Phe 404 (404), Met 421 (421) and Phe 425 (425) are labeled 
The carbon atoms of side chain of Met 421 are colored magenta, and the sulfur atom is colored 
yellow. Figure 6C is a cross-section of a space-filling model of OHT (red) bound in the ligand 
binding pocket. The view is equivalent to that in Figure 6B. The B ring of OHT (B) Phe 404 
(404), Met 421 (421) and Phe 425 (425) are labeled. The side chain of Met 421 is colored as in 
Figure 6B. The conformation of the B ring forces Met 42 1 to adopt a different conformation 
than in the one it adopts in the DES complex (compare with Figure 6B). Figure 6D provides a 
comparison of the ligand binding pocket bound to DES (green) and to OHT (red). The structures 
of the OHT complex and the DES complex were overlapped as in Figure 5. The A rings of both 
hgands point out of the page; the B ring of OHT and the A' ring of DES point into the page. The 
LBD bound to OHT is colored blue and the LBD bound to DES is colored light blue-grey. The 
side chains of some of the residues whose conformations are dramatically different between the 
two complexes are drawn; Met 342 (342), Met 343 (343), Phe 404 (404), Met 421 (421) He 424 
(424). Phe 425 (425), His 524 (524), Leu 525 (525), Met 528 (528). The sulfur atom of Met 421 
is colored yellow in both structures. 

Figure 7 illustrates a model of antagonist action. Agonist (white trianele) binding 
stabilizes a conformation of the LBD that promotes coactivator (yellow) binding Residues 5^7- 
530 (red) are pan of helix 1 1 (blue) and the length of the interhelical loop prevents helix 12 
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(magenta) from binding to the static region of the surface involved in transcriptional activity. 
Antagonist (white cross) side chains preclude helix 12 from being positioned over the ligand 
binding pocket. Residues 527-530 (red) adopt an extended conformation as a result of 
antagonist-driven structural perturbations in and around the ligand binding pocket. The length of 
5 the loop between helices 1 1 and 12 allows helix 12 to bind the static region of this surface and 
inhibit coactivator recognition. 

Figure 8 shows alignment of amino acid sequences (single letter amino acid designations) 
containing residues that form the coactivator binding sites of several nuclear receptors: human 
and recombinant thyroid hormones (hTRP and rTRa) (SEQ ID NO:5 and 6 and SEQ ID NO:7 
1 0 and 8), retinoids (hRARy and hRXRct) (SEQ ID NO:9 and 1 0 and SEQ ID NO: 1 1 and 12), 

peroxisome (hPPARy) (SEQ ID NO: 13 and 14), vitamin D (hVDR) (SEQ ID NO: 15 and 16), 
estrogen (hERct) (SEQ ID NO: 17 and 18), glucocorticoid (hGR) (SEQ ID NO: 19 and 20), 
progestin (hPR) (SEQ ID NO:21 and 22), mineralocorticoid (hMR) (SEQ ID NO:23 and 24) and 
androgen (hAR) (SEQ ID NO:25 and 26). The boxes represent residues of alpha-helix (H3. H4, 
15 H5, H6 and H 1 2); lower case letters "h" and "q" represent hydrophobic and polar residues, 

respectively. The numbered positions at the top of the table, 280, 281, etc., correspond to hTRp 
residues. 

DESCRIPTION OF SPECIFIC EMBOD IMENTS 
The present invention provides methods and compositions for identifying compounds 
20 that modulate nuclear receptor activity, in particular steroid receptor activity, and more 
particularly estrogen receptor activity. The compounds are nuclear receptor agonists or 
antagonists that bind to the ligand binding domain. Compounds that bind to the LBD also are 
provided. The compounds can be natural or synthetic. Preferred compounds are small organic 
molecules, peptides and peptidomimetics (e.g., cyclic peptides, peptide analogs, or constrained 
25 peptides). 

Certain residues within the ERa LBD have been identified that are of particular 
importance:Met343. Leu346, Ala350, Glu353. Leu384. Leu387, Leu391, Arg394. Phe404, 
Met421. Leu428, Gly521, His524, Leu525 and Met528 (See Figure 4A). Of these, some have 
been found to directly or indirectly effect the positioning of helix 12: Met343. Met421 . His524, 
30 Leu525 and Met528. Interactions with these particular residues, such as occurs when DES binds 
to the receptor stabilizes a conformation of the LBD that promotes coactivator binding. 
Modifications to a ligand that enhance binding or interaction with these residues would provide 
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for an improved agonist of receptor activity. Similarly, modifications to a ligand that adversely 
affects the binding or interaction with these residues would provide for an improved antagonist. 

In addition, it is believed that the ERct Tyr537 residue plays a role in stabilizing the 
unliganded receptor so that helix 12 is free to interact with the coactivator binding site. The ER 
is quite unique in having a tyrosine at this position as hTRp, rTRa, hRARy, hGR. hPR. hMR and 
hAR all have a proline residue. hRXRa has as aspartic acid residue. hPPARy has a histidine 
residue and hVDR has a threonine residue at positions corresponding to the Tyr 537 residue of 
hERa. Therefore, selective agonists and antagonists can be designed for the estrogen receptor 
that interact with Tyr 537. 

Traditional antagonists such as OHT or RAL have side chains that directly block helix 1 2 
positioning as it is in the agonist complexes (DES or E 2 ). In addition, it is expected that ligand-" 
induced perturbations are required to allow helix 12 to reach into the coactivator helix binding 
site, thereby blocking coactivator binding and inhibiting transcriptional activity. Such 
perturbations or interference with proper interactions with one or more of these LBD residues, 
has been found to relax the receptor, disassembling the receptor's secondary structure and 
resulting in unwinding of helix 1 1 . The unwinding of helix 1 1 increases the length of the loop 
between helices 1 1 and 12, allowing helix 12 to move away from the ligand binding pocket and 
towards the coactivator binding site, where it occludes the coactivator recognition groove by 
mimicking the interactions of the coactivator, and thus inhibits coactivator recognition (see 
Figure 7). Modifications to a ligand that interfere with binding or interaction with one or more 
of the amino acids positions indicated, would cause receptor relaxation, affecting the receptor's 
secondary structure and cause the unwinding of helix 12. Compounds based upon such modified 
ligands would act as antagonists. 

Accordingly, one aspect of the invention is a method of identify ing a compound that 
modulates (i.e., increases or decreases) nuclear receptor activity, comprising: modeling test 
compounds that fit spatially into a nuclear receptor ligand binding domain of interest using an 
atomic structural model of the estrogen receptor a ligand binding domain or portion thereof, 
screening the test compounds in an assay, for example a biological assay, characterized by 
binding of a test compound to the ligand binding domain, and identifying a test compound that 
modulates nuclear receptor activity, wherein the atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human estrogen receptor a 
Met343, Leu346. Ala350. Glu353, Leu384. Leu387. Leu391. Ar g 394. Phe404. Met421 . Leu4?8 



1 1 



BNSDOCIO: <WO. 



.99S0658A2J_> 



PCT/US99/06937 

WO 99/50658 



Gly521. His524. Leu525 and Met528. preferably Met343. Met421 . His524. Leu525 and Met528. 
In a preferred embodiment, the nuclear receptor is the ER. The test compound can be an agonist 
and nuclear receptor activity is measured by binding of a coactivator or a compound that mimics 
a coactivator, to the coactivator binding site, as defined below. On the other hand, the test 
compound can be an antagonist and nuclear receptor activity is measured by the unwinding of 
helix 12 and/or the blocking of coactivator binding to the coactivator binding site. The screening 
is typically in vitro, and high throughput screening is preferable. Suitable test compounds can be 
designed, as is described later, or can be obtained from a library of compounds, and include, by 
means of illustration and not limitation, small organic molecules, peptides and peptidomimetics. 
The method described above may also include the step of providing the atomic coordinates of the 
estrogen receptor a ligand binding domain or portion thereof to a computerized modeling 

system, prior to said modeling step. 

As used herein, the term "portion thereof is intended to mean the atomic coordinates 
corresponding to a sufficient number of residues or their atoms of the LBD that interact with a 
compound capable of binding to the site. This includes receptor residues having an atom within 
4.5 A of a bound compound or fragment thereof. Thus, for example, the atomic coordinates 
provided to the modeling system can contain atoms of the nuclear receptor LBD, part of the LBD 
such as atoms corresponding to the LBD or a subset of atoms useful in the modeling and design 
of compounds that bind to a LBD. 

The atomic coordinates of a compound that fits into the ligand binding domain also can 
be used for modeling to identify compounds or fragments that bind the site. By "modeling" is 
intended quantitative and qualitative analysis of molecular structure/function based on atomic 
structural information and receptor-ligand agonists/antagonists interaction models. This includes 
conventional numeric-based molecular dynamic and energy minimization models, interactive 
computer graphic models, modified molecular mechanics models, distance geometry and other 
structure-based constraint models. Modeling is preferably performed using a computer and may 
be further optimized using known methods. By "fits spatially" is intended that the three- 
dimensional structure of a compound is accommodated geometrically by a cavity or pocket of a 
nuclear receptor LBD. 

It is expected that targeting the corresponding amino acids on other nuclear receptors will 
have the same effect. Accordingly, one embodiment of the invention pertains to methods of 
designing antagonists that bind the LBD of a nuclear receptor but do not interact with one of 
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more residues within the LBD that correspond to (i.e.. the same as or equivalent to) human ERcc 
residues Met343. Leu346. Ala350, Glu353, Leu384. Leu387. Leu391. Arg394, Phe404. Met421. 
Leu428. Gly521. His524. Leu525 and Met528, preferably Met343. Met421, His524, Leu525 and 
Met528. Similarly, another embodiment of the invention pertains to methods of designing 
agonists that bind the LBD of a nuclear receptor and have enhanced interaction with one or more 
residues within the LBD that correspond to the human ERcc residues Met343, Leu346. Ala350 
G1U353, Leu384. Leu387, Leu391, Arg394, Phe404. Met421, Leu428, Gly521, His524. Leu525 
and Met528, preferably Met343, Met421, His524, Leu525 and Met528. An example of 
enhanced interaction is where the agonist has a greater binding affinity for one or more of said 
res.dues. as compared to an endogenous ligand. Such corresponding positions for other 
members of the nuclear receptor family are shown in Table I. which provides an alignment of 
ammo acid sequences (one letter amino acid designations ) containing residues from the ligand 
b.nding domains of several nuclear receptors that correspond to the designated positions on the 
human estrogen receptor (hERa): recombinant thyroid hormone (rTRa), retinoids (hRARy and 
hRXRa), glucocorticoid (hGRa), progestin (hPR), mineralocorticoid (hMR) and androgen 
(hARct). It is understood that Table 1 is merely illustrative of the invention and is not intended 
to be limiting in any manner. Accordingly, it is understood that corresponding amino acid 
residues of other nuclear receptors such as other estrogen receptors, thyroid receptors, retinoid 
receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, androgen 
receptors, peroxisome receptors and vitamin D receptors, may also be used in the methods of the 
invention. 



Table 1 

hERa M343 M421 F425 H524 

hPR L715 F794 1798 Y890 

hARa L701 M780 Q784 F876 

hGRa M560 M639 Q643 Y735 

hMR L766 M845 L849 F941 

hRARy W227 F304 L308 R396 

hRXRa V265 G343 F346 H435 

rTRa F215 L292 V296 R384 



L525 
C891 
T877 
C736 
C942 
A397 
L436 
F385 



M528 
T894 
L881 
T740 
T946 
L401 
F440 
M389 



The term "coactivator binding site" is used herein to mean a structural segment or 
segments of the nuclear receptor polypeptide chain folded in such a way so as to give the proper 
geometry and amino acid residue conformation for binding coactivator. This is the physical 
arrangement of protein atoms in three-dimensional space forming a coactivator binding site 
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pocket or cavity. As described by Apriletti. et al. ? supra , residues forming the coactivator 
binding site on nuclear receptors are amino acids that correspond to (i.e.. the same as or 
equivalent to) human TR residues of C-terminal helix 3 (Ile280. Thr281 . Val283. Val284, 
Ala287. and Lys288), helix 4 (Phe293). helix 5 (Gln301, Ile302. Leu305. Lys306). helix 6 
(Cys309). and helix 12 (Pro453. Leu454. Glu457, Val458 and Phe459). as shown in Figure 8. 
The coactivator binding site is highly conserved among the nuclear receptor super family. Thus, 
this site corresponds to a surprisingly small cluster of residues on the surface of the LBD that 
form a prominent hydrophobic cleft. The hydrophobic cleft is formed by hydrophobic residues 
corresponding to human TR residues of C-terminal helix 3 (Ile280, Val283. Val284, and 
Ala287), helix 4 (Phe293), helix 5 (lle302 and Leu305), helix 6 (Cys309-), and helix 12 (Leu454, 
Val458 and Phe459). This hydrophobic cleft of the coactivator binding site is also highly 
conserved among the nuclear receptor super family. 

Based upon the Examples set forth herein, residues forming the coactivator binding site 
on the estrogen receptor were found to correspond to those positions described above for the 
human TR. Accordingly, the residues forming the coactivator binding site on ERa are the 
human ERa residues of C-terminal helix 3 (Leu354, Val355, Met357, Ile358, Ala361, and 
Lys362), helix 4 (Phe367), helix 5 (Gln375, Val376, Leu379, Glu380), helix 6 (Trp383), and 
helix 12 (Asp538, Leu539, Glu542, Met543 and Leu544), as shown in Figure 8. As noted above 
for the nuclear receptor family in general, this site corresponds to residues on the surface of the 
LBD that form a prominent hydrophobic cleft, formed by hydrophobic residues corresponding to 
human ERa residues of C-terminal helix 3 (Leu354, Met357, lle358 and Ala361), helix 4 
(Phe367), helix 5 (Val376, Leu379). helix 6 (Trp383), and helix 12 (Leu539. Met543 and 
Leu544). This corroborates the data presented by Apriletti, et al., supra, for the nuclear receptor 
family. 

Structural analysis has revealed the mechanisms by which tamoxifen and other SERMs 
bind the ligand binding domain and block coactivator binding and hence transcriptional activity. 
By this, an understanding of ligand and coactivator binding has also been achieved. Therefore, 
the coactivator binding site residues described above are useful in designing coactivator mimics 
that have broad application in the methods of the instant invention. Such "coactivator mimics" 
are peptides or polypeptides that mimic the coactivator binding site recognition area on the 
surface of a coactivator such that a "coactivator mimic" acts as a competitive inhibitor of 
coactivator binding to the coactivator binding site. Coactivator mimics can be used in an assay 
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to determine receptor activity and hence the agonist or antagonist nature of a test compound, in 
that an agonist will permit a coactivator mimic to bind to the coactivator binding site, while an 
antagonist will prevent such binding. In addition, such coactivator mimics may have therapeutic 
utility when administered in combination with an agonist compound of the invention. 

Another embodiment of the invention pertains to a method of identifying a compound 
that modulates ligand binding to a nuclear receptor, typically by binding to the ligand binding 
domain. This method comprises the steps of modeling test compounds that fit spatially into a 
nuclear receptor ligand binding domain of interest using an atomic structural model of the 
estrogen receptor a ligand binding domain or portion thereof, screening the test compounds in an 
assay characterized by binding of a test compound to the binding domain, and identifying a test 
compound that modulates ligand binding to said nuclear receptor, wherein said atomic structural 
model comprises atomic coordinates of amino acid residues corresponding to residues of human 
estrogen receptor a Met343, Leu346, Ala350, Glu353. Leu384, Leu387, Leu391, Arg394, 
Phe404, Met421, Leu428, Gly521, His524, Leu525 and Met528, preferably Met343, Met421, 
His524, Leu525 and Met528. In the preferred method, the nuclear receptor is ER. TR, GR or 
PR. The screening is typically in vitro such as by high throughput screening. Suitable test 
compounds can be designed or obtained from a library of compounds, and include, by means of 
illustration and not limitation, small organic molecules, peptides and peptidomimetics. The test 
compounds can be either agonists or antagonists of ligand binding. 

The invention also includes compositions and methods for identifying key residues 
within the ligand binding domains of nuclear receptors. The methods involve examining the 
surface of a nuclear receptor of interest to identify residues that modulate ligand and/or 
coactivator binding. The residues can be identified by homology to the key residues on the LBD 
of human ERct described herein. A preferred method is alignment with the residues of any 
nuclear receptor corresponding to (i.e., equivalent to) human ERct residues of Met343. Leu346. 
Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Met421, Leu428. Gly521. His524, 
Leu525 and Met528, preferably Met343, Met421. His524. Leu525 and Met528. Overlays and ' 
superpositioning with a three-dimensional model of a nuclear receptor LBD, or a portion thereof 
that contains these or corresponding residues, also can be used for this purpose. For example, 
three-dimensional structures of TR, GR and PR LBDs can be used for this purpose. For 
example, nuclear receptors identifiable by homology alignment include normal nuclear receptors 
or proteins structurally related to nuclear receptors found in humans, natural mutants of nuclear 
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receptors found in humans, normal or mutant receptors found in animals, as well as non- 
mammalian organisms such as pests or infectious organisms, or viruses. 

Alignment and/or modeling also can be used as a guide for the placement of mutations on 
the LBD surface to characterize the nature of the site in the context of a cell. Selected residues 
are mutated to preserve global receptor structure and solubility in the case of an agonist, or to 
disassemble such structure and permit helix 12 to unwind, as is the case with an antagonist. 
Mutants can be tested for ligand binding as well as the relative change in strength of the binding 
interaction. Ligand-dependent coactivator interaction assays also can be tested for this purpose. 

such as those described herein. 

In particular, the present invention relates to the structural and functional effects on the 
estrogen receptor's LBD. of the binding of two chemically-related compounds, the agonist, 
diethylstilbestrol (DES), and the selective antagonist 4-hydroxytamoxifen (OHT), the active 
metabolite of tamoxifen. As described in the Examples, mutagenesis and binding studies, 
coupled with analysis of atomic models derived from cocrystals, reveals the structure of the 
human estrogen receptor a ligand binding domain (ERct LBD) co-crystallized with a peptide 
molecule comprising a GRIP1 NR Box II peptide sequence (SEQ ID NO:4) (i.e., a peptide 
derived from the NR Box II region of the pi 60 coactivator GRIP 1) bound to the coactivator 
binding site and the agonist, DES. Also revealed is the structure of the ERct LBD co-crystallized 
with the antagonist, OHT. The Examples provide the 2. 03 A resolution crystal structure of the 
hERa LBD bound to DES and the coactivator and the 1 .9A x-ray crystal structure of the hERct 
LBD bound to OHT. i.e.. the crystals defract with at least 2.03A or 1 .9A resolution, respectively. 

In yet another aspect of the invention, compounds of interest are discovered, i.e., agonists 
or antagonists of ligand binding are identified, by a method for identifying an agonist or 
antagonist of ligand binding to a nuclear receptor. The method comprises the steps of providing 
the atomic coordinates of the ERa LBD or portion thereof to a computerized modeling system, 
modeling compounds which fit spatially into the LBD, and identifying in an assay for nuclear 
receptor activity a compound which increases or decreases the activity of the nuclear receptor by 
binding the LBD of the nuclear receptor. Preferably, the atomic coordinates are of the amino 
acid residues corresponding to residues of human estrogen receptor a Met343. Leu346, Ala350. 
Glu353, Leu384. Leu387. Leu391. Arg394. Phe404. Met421. Leu428, Gly521. His524, Leu525 
and Met528, preferably Met343. Met421. His524. Leu525 and Met528. 
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Compounds of particular interest fit spatially and preferentially into the ligand binding 
domain. By "its spatially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a nuclear receptor LBD. 
Compounds that fit spatially and preferentially into the LBD interact with amino acid residues 
5 forming the hydrophobic cleft of this site. The present invention also includes a method for 

identifying a compound capable of selectively modulating nuclear receptor activity. The method 
comprises the steps of modeling test compounds that fit spatially and preferentially into the LBD 
of a nuclear receptor of interest using an atomic structural model of a nuclear receptor, screening 
the test compounds in an assay for nuclear receptor activity characterized by preferential binding 

10 of a test compound to the LBD of a nuclear receptor, and identifying a test compound that 

selectively modulates the activity of a nuclear receptor. Such receptor-specific compounds are 
selected that exploit differences between the LBDs of one type of nuclear receptor versus a 
second type of nuclear receptor. 

The invention also is applicable to generating new compounds that distinguish nuclear 

1 5 receptor isoforms. This can facilitate generation of either tissue-specific or function-specific 
compounds. For instance, GR subfamily members have usually one receptor encoded by a 
single gene, although there are exceptions. For example, there are two PR isoforms, A and B, 
translated from the same mRNA by alternate initiation from different AUG codons. There are 
two GR forms, one of which does not bind ligand. This method is especially applicable to the 

20 ER subfamily which usually has several receptors that are encoded by at least two (ER: a; p) 
genes or have alternate RNA splicing. 

The receptor-specific compounds of the invention preferably interact with 
conformationally constrained residues of the LBD that are conserved among one type of nuclear 
receptor compared to a second type of nuclear receptor. "Conformationally constrained" is 

25 intended to refer to the three-dimensional structure of a chemical or moiety thereof having 
certain rotations about its bonds fixed by various local geometric and physical-chemical 
constraints. Conformationally constrained structural features of a LBD include residues that 
have their natural flexible conformations fixed by various geometric and physical-chemical 
constraints, such as local backbone, local side chain, and topological constraints. These types of 

30 constraints are exploited to restrict positioning of atoms involved in receptor-coactivator 
recognition and binding. 
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The present invention also provides for a computational method using three dimensional 
models of nuclear receptors based on crystals of nuclear receptors. Generally, the computational 
method of designing a nuclear receptor ligand determines which amino acid or amino acids of a 
nuclear receptor LBD interact with a chemical moiety (at least one) of the ligand using a three 
dimensional model of a crystallized protein comprising a nuclear receptor LBD with a bound 
ligand, and selecting a chemical modification (at least one) of the chemical moiety to produce a 
second chemical moiety with a structure that either decreases or increases an interaction between 
the interacting amino acid and the second chemical moiety compared to the interaction between 
the interacting amino acid and the chemical moiety. In the instant invention, crystal structures of 
the hERct with DES/peptide and with OHT, have shown that amino acid residues that correspond 
to hERct Met343. Leu346, AIa350 ? Glu353, Leu384. Leu387, Leu391, Arg394, Phe404 ? Met421, 
Leu428, GIy521. His524. Leu525 and Met528, preferably Met343, Met42 1 , His524. Leu525 
and/or Met528, interact with at least one chemical moiety on the ligand. 

Accordingly, one embodiment of the invention is a computational method of designing a 
nuclear receptor ligand where at least one amino acid residue of a nuclear receptor LBD that 
corresponds to human estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, 
Leu391, Arg394, Phe404, Met421, Leu428, Gly52L His524, Leu525 and Met528, preferably 
Met343, Met42L His524, Leu525 and Met528, interacts with at least one first chemical moiety 
of the ligand, comprising the step of selecting at least one chemical modification of the first 
chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between the interacting amino acid and the second chemical moiety as 
compared to the interaction between the interacting amino acid and the first chemical moiety. 

This computational method may further comprise determining a change in interaction 
between the interacting amino acid and the ligand after chemical modification of the first 
chemical moiety. The chemical modification can either enhance or reduce hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or dipole 
interaction between the second chemical moiety and the interacting amino acid as compared to 
the interaction between the first chemical moiety and the interacting amino acid. 

Chemical modifications will often enhance or reduce interactions an atom of a LBD 
amino acid and an atom of an LBD ligand. Steric hindrance will be a common means of 
changing the interaction of the LBD binding cavity with the activation domain. Chemical 
modifications are preferably introduced at C-H, C- and C-OH position in ligands. where the 
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carbon is part of the ligand structure which remains the same after modification is complete. In 
the case of C-H, C could have 1 . 2 or 3 hydrogens, but usually only one hydrogen will be 
replace. The H or OH are removed after modification is complete and replaced with the desired 
chemical moiety. 

5 Such chemical modifications would preferably involve the addition of substituents. onto 

any of the free carbons of the A* ring of DES positioning these substituents to collide or bind 
preferentially with one or more residues that correspond to hERa Met343. Leu346, Ala350, 
Glu353, Leu384, Leu387, Leu391 , Arg394, Phe404, Met42K Leu428, Gly52L His524, Leu525 

and Met528, preferably Met343. Met421, His524, Leu525 or Met528. Typical substituents are 

i 

1 0 hydrophobic groups, including by way of example and not limitation, alky 1 groups such as ethyl, 
propyl, isopropyl, etc., and aromatic groups such as benzyl, etc. 

In practice, one would start with a known ligand for the nuclear receptor of interest as the 
chemical backbone, upon which to base agonist/antagonist design. The known ligand would be 
modified as described above. For example 1 7(5-estradiol is an endogenous ligand for the hERa. 

15 In the case of estradiol, positions of interest are C6a, C7a, C 1 2a, C 1 5a, C 1 6a and C 1 7a. 

Modifications at one or more of these free carbons on 17p-estradiol's backbone would affect the 
ligand's interactions with one or more of the Met343, Leu346, AIa350, Glu353 ? Leu384, Leu387, 
Leu39K Arg394, Phe404, Met42 1, Leu428, Gly521, His524, Leu525 and Met528, preferably 
Met343, Met421, His524, Leu525 and Met528 residues, either providing for enhancing 

20 interaction, which would be the basis for agonist design, or reduced interaction, which would be 
the basis for antagonist design. 

Other chemical backbones of other known Iigands could be used in a similar manner. 
For example, other known agonists include diethylstilbestrol (synthetic), moxestrol (synthetic), 
mesohexestrol (synthetic), coumestrol (clover), A 9 -THC (cannabis), o,p-DDT (insecticide), 

25 zearalenone (fungal) and kepone (insecticide). Known estrogen receptor antagonists include the 
ICI series of modified steroids such as ICI 164,384 and EM800. Known SERM's include 
tamoxifen, raloxifene and GW5638. 

Alternatively known agonists could be positioned in the ligand binding pocket through 
computational or manual docking. Positions for substitution would then be selected based on the 

30 predicted binding orientation of these compounds. In addition, hybrid molecules could be 
generated that also possessed side chains that prevented helix 12 from adopting the agonist- 
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bound position. Novel SERMs can be produced by varying the strength of two different effects: 
the helix 12 displacement and the secondary structure disorganization. 

Previous efforts to make antagonists have involved attachment of large bulky substituents 
to agonists, typically through trial and error, and the drug design methods described herein 
5 provide an alternative strategy of ligand design that may be critical for developing new potential 
antagonists. 

For modeling, docking algorithms and computer programs that employ them can be used 
to identify compounds that fit into the ligand binding domain. For example, docking programs 
can be used to predict how a molecule of interest can interact with the nuclear receptor LBD. 

10 Fragment-based docking also can be used in building molecules de novo inside the LBD, by 
placing chemical fragments that complement the site to optimize intermolecular interactions. 
The techniques can be used to optimize the geometry of the binding interactions. This design 
approach has been made possible by identification of the LBD structure thus, the principles of 
molecular recognition can now be used to design a compound which is complementary to the 

15 structure of this site. Compounds fitting the LBD serve as a starting point for an iterative design, 
synthesis and test cycle in which new compounds are selected and optimized for desired 
properties including affinity, efficacy, and selectivity. For example, the compounds can be 
subjected to addition modification, such as replacement and/or addition of R-group substituents 
of a core structure identified for a particular class of binding compounds, modeling and/or 

20 activity screening if desired, and then subjected to additional rounds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a nuclear receptor ligand binding domain of 
interest. In this screening, the quality of fit of such entities or compounds to the binding site may 
be judged either by shape complementarity (DesJalais, et al., J. Med . Chem . 3 1 : 722-729 (1988)) 

25 or by estimated interaction energy (Meng, et al., J. Como . Chem . 13:505-524 (1992)). The 
molecule databases include any virtual or physical database, such as electronic and physical 
compound library databases, and are preferably used in developing compounds that modulate 
coactivator binding. 

Compounds can be designed intelligently by exploiting available structural and 
30 functional information by gaining an understanding of the quantitative structure-activity 

relationship (QSAR), using that understanding to design new compound libraries, particularly 
focused libraries having chemical diversity of one or more particular groups of a core structure. 
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and incorporating any structural data into that iterative design process. For example, one skilled 
in the art may use one of several methods to screen chemical entities or fragments for their 
ability to associate with the ligand binding domain of a nuclear receptor of interest. This process 
may begin by visual inspection of. for example, the LBD on the computer screen. Selected 
fragments or chemical entities may then be positioned into all or part of the site. Docking may 
be accomplished using software such as Quanta and Sybyl, followed by energy minimization and 
molecular dynamics with standard molecular mechanics force-fields, such as CHARMM and 
AMBER. 

Specialized computer programs may also assist in the process of selecting chemical entity 
fragments or whole compounds. These include: GRID (Goodford, J. Med. Chem . 28:849-857 
(1985), available from Oxford University, Oxford, UK); MCSS (Miranker. et al., "Proteins: 
Structure, Function and Genetics" 1 1:29-34 (1991), available from Molecular Simulations, 
Burlington. MA); AUTODOCK (Goodsell, et al., "Proteins: Structure. Function and Genetics" 
8:195-202 (1990), available from Scripps Research Institute, La Jolla, CA); and DOCK (Kuntz, 
et al, J. Mol. Biol. 161 :269-288 (1982), available from University of California, San Francisco ' 
CA), 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem . Peg. 
Auto . News 8:44-47 (1993)). 

Once suitable chemical entities or fragments have been selected, they can be assembled 
into a single compound. Assembly may be proceeded by visual inspection of the relationship of 
the fragments to each other on the three-dimensional image displayed on a computer screen in 
relation to the structure coordinates of a nuclear receptor. This can be followed by manual 
model building using software such as Quanta or Sybyl. 

Useful programs to aid one of skill in the art in connecting the individual chemical 
entities or fragments include: CAVEAT (Bartlett, et al., "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules", in Molecular Recognition in 
Chemical and Biological Problems, Special Pub., Royal Chem. Soc. 78:182-196 (1989), 
available from the University of California. Berkeley, CA); 3D Database systems such as 
MACCS-3D (MDL Information Systems. San Leandro. CA. reviewed in Martin, J. Med. Chem . 
35:2145-2154 (1992)); and HOOK (available from Molecular Simulations. Burlington. MA). 
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In addition to building a compound in a step-wise fashion, one fragment or chemical 
entity at a time as described above, compounds that bind to a ligand binding domain of interest 
also may be designed as a whole or de novo using either an empty LBD or optionally including 
some portion(s) of a molecule known to bind to the site, such as a known ligand. These methods 
5 include: LUDI (Bohm, J. Comp . Aid . Molec . Design 6:61-78 (1992), available from Biosm 
Technologies, San Diego, CA); LEGEND (Nishibata. et al., Tetrahedron 47:8985 (1991), 
available from Molecular Simulations. Burlington. MA); and LeapFrog (available from Tripos 

Associates. St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance with this 

10 invention. See, for example. Cohen, et al., J. Med. Chem. 33:883-894 (1990) and Navia, et al., 
Current Opinions in Structural Biology 2:202-210 (1992). For example, where the structures of 
test compounds are known, a model of the test compound may be superimposed over the model 
of the structure of the invention. Numerous methods and techniques are known in the art for 
performing this step, any of which may be used. See, for example, Farmer, "Drug Design" 

15 10:1 19-143, Ariens, ed.. Academic Press. New York (1980); U.S. Patent No. 5,331,573; U.S. 
Patent No. 5,500,807; Verlinde, Structure 2:577-587 (1994); and Kuntz, et al., Science 
257:1078-1082 (1992). The model building techniques and computer evaluation systems 
described herein are not a limitation on the present invention. 

Using these computer modeling systems a large number of compounds may be quickly 

20 and easily examined and expensive and lengthy biochemical testing avoided. Moreover, the 
need for actual synthesis of many compounds can be substantially reduced and/or effectively 
eliminated. 

Compounds identified through modeling can be screened in assays such as are well 
known in the art. Such assays, which include biological assays, are characterized by binding of 

25 the compound to a ligand binding domain of interest for ligand binding activity. Screening can 
be, for example, in vitro, in cell culture, and/or in vivo. Biological screening preferably centers 
on activity-based response models, binding assays (which measure how well a compound binds 
to the receptor), and bacterial, yeast and animal cell lines (which measure the biological effect of 
a compound in a cell). The assays can be automated for high capacity-high throughput screening 

30 (HTS) in which large numbers of compounds can be tested to identify compounds with the 
desired activity. 
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As an example, in vitro binding assays can be performed in which compounds are tested 
for their ability to block the binding of a ligand. fragment, fusion or peptide thereof, to a liaand 
binding domain of interest. For cell and tissue culture assays, they may be performed to assess a 
compound's ability to block function of cellular coactivators. such as members of the p 1 60 
family of coactivator proteins, such as SRC-1. AIB1. RAC3. p/CIP. and GRIP1 and its 
homologues TIF 2 and NcoA-2, and those that exhibit receptor and/or isoform-specific binding 
affinity. In a preferred embodiment, compounds of the invention bind to a ligand binding 
domain with greater affinity than the endogenous ligands. Tissue profiling and appropriate 
animal models also can be used to select compounds. Different cell types and tissues also can be 
used for these biological screening assays. Suitable assays for such screening are described 
herein and in Shibata. et al.. Recent Prog. Horm . Res. 52:141-164 (1997); Tagami. et al., Mol. 
Cell. BioJ. 17(5):2642-2648 (1997); Zhu, et al., J. Biol. Chem . 272(14):9048-9054 (1997); Lin. 
etal.,MoI. Cell. Biol. 1 7(10):61 3 1 -6 138 (1997); Kakizawa, et al., J. Biol. Chem . 272(38):23799- 
23804 (1997); and Chang, et al., Proc . Natl. Acad . Sci. USA 94(17):9040-9045 (1997), which 
references are incorporated herein in their entirety by reference. 

The compounds selected can have agonist and/or antagonistic properties. The 
compounds also include those that exhibit new properties with varying mixtures of agonist and 
antagonist activities, depending on the effects of altering ligand binding in the context of 
different activities of nuclear receptors, either hormone-dependent or hormone-independent, 
which are mediated by proteins other than coactivators. and which interact with the receptors at 
locations other than the coactivator binding site. The compounds also include those, which 
through their binding to receptor locations that are conformationally sensitive to hormone 
binding, have allosteric effects on the receptor by stabilizing or destabilizing the hormone-bound 
conformation of the receptor, or by directly inducing the same, similar, or different 
conformational changes induced in the receptor by the binding of hormone. 

Of particular interest is use of such compounds in a method of modulating nuclear 
receptor activity in a mammal by administering to a mammal in need thereof a sufficient amount 
of a compound that fits spatially and preferentially into a ligand binding domain of a nuclear 
receptor of interest. By "modulating" is intended increasing or decreasing activity of a nuclear 
receptor. For example, pre-clinical candidate compounds can be tested in appropriate animal 
models in order to measure efficacy, absorption, pharmacokinetics and toxicity following 
standard techniques known in the art. Compounds exhibiting desired properties are then tested ' 
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in clinical trials for use in treatment of various nuclear receptor-based disorders. These include 
ER-based disorders, such as postmenopausal symptoms and cancer resulting from loss of 
estrogen production, and osteoporosis and cardiovascular disease stemming from traditional 
estrogen replacement therapy. Others include GR-based disorders including Type II diabetes 

5 and inflammatory conditions such as rheumatic diseases. 

Although for many nuclear receptors, the goal is to discover novel synthetic agonists or 
antagonists, it is important to realize the value for some nuclear receptors, especially the estrogen 
receptor, of developing compounds that have the desired agonist and antagonist effects in target 
tissues. Such compounds can be discovered and/or designed by the methods described herein, 

10 then screened for tissue specificity by methods that are well known in the art. For example, there 
is a great need for the improvement of existing therapies and the development of new agonists 
that act like estrogen in cardiovascular and brain tissue and bone, and new antagonists that act 
upon the estrogen receptor in uterine and breast tissue. Ideally, a compound will have more than 
one of these traits, i.e., a compound will act as an agonist in one tissue, while acting as an 

15 antagonist in another tissue. While the tissue-selective antagonism of SERMs such as OHT and 
RAL is the result of numerous factors (Grainger, et al., Nature Medicine 2(4):38 1-385 (1996); 
Grese, et al M supra ; and Jordan, J. Natl . Cancer Inst . 90:967-71 (1998)), dissection of the 
mechanisms of action of these ligands requires a comprehensive understanding of how they act 
on the LBD and regulate its interactions with other cellular factors. The instant invention shows, 

20 unexpectedly, that ligand-mediated structural perturbations in and around the ligand binding 
pocket, and not simply side chain effects, contribute to receptor antagonism. Accordingly, by 
adjusting the balance between these two effects provides a novel strategy for the design of 
improved SERMs. 

With this knowledge, it is of particular interest to design therapeutic compounds that will 
25 distort at least one amino acid residue corresponding to residues of human estrogen receptor a 

Met343 ? Leu346, Ala350, Glu353. Leu384, Leu387, Leu391, Arg394. Phe404, Met421, Leu428, 
Gly521. His524, Leu525 and Met528, preferably Met343, Met42L His524. Leu525 and Met528. 
Accordingly, one aspect of the invention is a method of modulating nuclear receptor activity in a 
mammal by administering to a mammal in need thereof a sufficient amount of a ligand that fits 
30 spatially and preferentially into a ligand binding domain of a nuclear receptor of interest, 

wherein the ligand is designed by a computational method where at least one amino acid residue 
of a nuclear receptor LBD that corresponds to hERot Met343, Leu346. Ala350. Glu353, Leu384 ; 
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Leu387. Leu39l . Arg394. Phe404. Met421. Leu428. GW521. His524, Leu525 and Met5?8 
preferably Met343. Met42 1 . His524. Leu525 and Met528. interacts with at least one first * 
chemical moiety of the ligand. Such a method involves selecting at least one chemical 
modification of the first chemical moiety to produce a second chemical moiety with a structure 
that cither decreases or increases an interaction between the interacting amino acid and the 
second chemical moiety as compared to the interaction between the interacting amino acid and 
the first chemical moiety. 

Compounds designed by this method can be either agonists or antagonists and the method 
of modulating nuclear receptor activity can comprise administering an antagonist alone an 
agonist alone or an agonist in combination with a coactivator or a compound that mimics a 
coactivator by binding to the coactivator binding site. 

The coactivator can be a known coactivator. The coactivator mimic can be desiened by a 
computational method where at least one amino acid residue of a nuclear receptor coactivator 
binding site that corresponds to hERcc helix 3 residues Leu354, Val355, Met357 Ile358 Ala361 
and Lys362, helix 4 residue Phe367, helix 5 residues Gln375, VaI376, Leu379 and Glu380 helix 
6 residue Trp383. and helix 12 residues Asp538, Leu539. Glu542, Met543 and Leu544 interacts 
with at least one first chemical moiety of the coactivator mimic. The method involves selecting 
at least one chemical modification of the first chemical moiety to produce a second chemical 
moiety with a structure that either decreases or increases an interaction between the interacting 
amino acid and the second chemical moiety as compared to the interaction between the 
interacting amino acid and the first chemical moiety. 

Use of an agonist in combination with a coactivator or coactivator mimic also provides a 
unique strategy for delivering therapeutics that have novel tissue-specific effects. For example 
coactivator mimics can be designed to bind into the site involved in transcriptional activity only 
when hel,x-12 is in its agonist bound state. If such coactivator mimics are specific for this site of 
a particular receptor, it is possible to selectively inhibit that receptor only in the presence of 
agonist. This could lead to novel, tissue specific antagonism based on the levels of endogenous 
agonists. Agonists designed by the methods of the instant invention could be used in assay to 
determine the specificity of coactivator mimics. Alternatively, the effective levels in a given 
tissue could be modulated by giving known antagonists or antagonists designed bv the methods 
of the instant invention. The crystal structure of the LBD/DES/GRIP1 peptide complex 
described herein, precisely defines the binding site that would need to be targeted 
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As noted above and as exemplified in the Examples. ER LBDs are co-crystallized with a 
peptide molecule comprising a coactivator GRIP1 NR Box II peptide sequence (SEQ ID NO:4) 
bound to the coactivator binding site and DES with the cocrystal structure refined to a resolution 
of 2.03 A and co-crystallized with OHT with the cocrystal structure refined to a resolution of 
1.9A. Accordingly, the invention also provides for cocrystals made from nuclear receptor ligand 
binding domains with a ligand bound to the ligand binding domain and a molecule bound to the 
coactivator binding site. Preferably the cocrystal structure is refined to a resolution greater than 
3.6A. i.e.. having a resolution value less than 3.6A. More preferably the cocrystal structure is 
refined to greater than 3.4A, 3.2A. 3.0A. 2.8 A, 2.6A, 2.4A, 2.2A, even more preferably to a 
resolution greater than 2.03A. The invention further provides for cocrystals made from nuclear 
receptor ligand binding domains with a ligand bound to the ligand binding domain. Preferably 
the cocrystal structure is refined to a resolution greater than 3. 6 A. i.e., having a resolution value 
less than 3. 6 A. More preferably the cocrystal structure is refined to greater than 3. 4 A, 3. 2 A, 
3.0A, 2.8A, 2.6A, 2.4A, 2.2A, 2.0A, even more preferably to a resolution greater than 1.9A. 

Crystals are made from purified nuclear receptor LBDs that are usually expressed by a 
cell culture, such as E. coli. E. coli is often a preferred expression system. The thyroid receptor 
was successfully expressed in E. coli in Apriletti, et al., supra. However, it has long been 
believed that a human heat shock protein was required for successful recombinant expression of 
the estrogen receptor. Therefore, it was quite unexpected to find that the estrogen receptor could 
be expressed as an active protein in E. coli. 

Preferably, different crystals (cocrystals) for the same nuclear receptor are separately 
made using different coactivators-type molecules, such as protein fragments, fusions or small 
peptides. The coactivator-type molecules preferably contain NR-box sequences necessary for 
binding to the coactivator binding site, or derivatives of NR-box sequences. Other molecules 
can be used in co-crystallization, such as small organics that bind to the coactivator or hormone 
binding site(s). Heavy atom substitutions can be included in the LBD and/or a co-crystallizing 
molecule. 

After the three dimensional structure of the cocrystal is determined, the structural 
information can be used in computational methods to design synthetic compounds for the nuclear 
receptor, and further structure-activity relationships can be determined through routine testing 
using the assays described herein and known in the art. 
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Since nuclear receptor LBDs may crystallize in more than one crystal form the structure 
coordinates of such receptors or portions thereof, as provided in Appendices 1 and ? are 
particularly useful to solve the structure of those other crystal forms of nuclear receptors Thev 
may also be used to solve the structure of mutants or co-complexes of nuclear receptors havin* 
sufficient homology. 

One method that may be employed for this purpose is molecular replacement In this 
method, the unknown crystal structure, may be determined using the structure coordinates of this 
invention as provided in Appendices 1 and 2. The Appendix 1 coordinates for the DES-ERct 
LBD-GRIP1 NR Box II peptide complex and for the Appendix 2 coordinates for the OHT-ERa 
LBD complex have been deposited with the Brookhaven National Laboratory Protein Data 
Bank, and have been assigned Brookhaven Protein Data Bank Accession Numbers 2erd and 2ert 
respecuvely. This method will provide an accurate structural form for the unknown crystal more 
quickly and efficiently than attempting to determine such information ab initio. 

Atomic coordinate information gleaned from the crystals of the invention can be stored 
In a preferred embodiment, the information is provided in the form of a machine-readable data 
storage medium. This medium contains information for constructing and/or manipulating an 
atormc model of a ligand binding domain or portion thereof. For example, the machine readable 
data for the ligand binding domain comprises structure coordinates of amino acids corresponding 
to hERa Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404 Met421 
Leu428, Gly521. His524, Leu525 and Met528, preferably Met343, Met421. His524. Leu525 and 
Met528, or a homologue of the molecule or molecular complex comprising the site The 
homologues comprise a LBD that has a root mean square deviation from the backbone atoms of 
the ammo acids of not more than 1 .5A. A preferred molecule or complex represents a compound 
bound to the LBD. 

The machine-readable data storage medium can be used for interactive drug design and 
molecular replacement studies. For example, a data storage material is encoded with a first set 
of machine-readable data that can be combined with a second set of machine-readable data For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion 
of the structural coordinates of the nuclear receptor or portion thereof of interest, and the second 
data set comprises an X-ray diffraction pattern of the molecule or molecular complex of interest 
Using a machine programmed with instructions for using the first and second data sets a portion 
or all of the structure coordinates corresponding to the second data can be determined 
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Protein for crystals and assays described herein can be produced using expression and 
purification techniques described herein and known in the art. For example, high level 
expression of nuclear receptor LBDs can be obtained in suitable expression hosts such as E. coli. 
Expression of LBDs in E. coli. for example, includes the ERa LBD and other nuclear receptors, 
including GR and PR. Yeast and other eukaryotic expression systems can be used with nuclear 
receptors that bind heat shock proteins as these nuclear receptors are generally more difficult to 
express in bacteria, with the exception of ER. which can be expressed in bacteria. 
Representative nuclear receptors or their ligand binding domains have been cloned and 
sequenced: human ER (as described in Seielstad, et al., Molecular Endocrinology 9(6):647-658 
(1995), incorporated herein by reference), human GR, and human PR. The LBD for each of 
these receptors has been identified. 

Coactivator proteins can be expressed using techniques known in the an, particularly 
members of the pi 60 family of coactivator proteins that have been cloned and/or expressed 
previously, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues TIF 2 and 
NcoA-2. A preferred method for expression of coactivator protein is to express a fragment that 
retains transcriptional activation activity using the "Song and Fields" method (also referred to as 
the "yeast 2-hybrid" method) as described in publications by Hong, et al., MoL CeH. BioL 
17:2735-44 (1997) and Proc. Natl. Acad . Sd. USA 93(10):4948-52 (1996)), for GRIP1 
expression, which references are incorporated herein by reference. 

The proteins can be expressed alone, as fragments of the mature or full-length sequence, 
or as fusions to heterologous sequences. For example, ERct can be expressed without any 
portion of the DBD or amino-terminal domain. Portions of the DBD or amino-terminus can be 
included if further structural information with amino acids adjacent the LBD is desired. 
Generally, for the ERa the LBD used for crystals will be less than 320 amino acids in length. 
Preferably, the ERa LBD will be at least 220 amino acids in length and most preferably at least 
250 amino acids in length. For example the LBD used for crystallization can comprise amino 
acids spanning from 297 to 554 of the ERa. 

Typically the LBDs are purified to homogeneity for crystallization. Purity of LBDs can 
be measured with sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). 
mass spectrometry (MS) and hydrophobic high performance liquid chromatography (HPLC). 
The purified LBD for crystallization should be at least 97.5 % pure, preferably at least 99.0% 
pure, and more preferably at least 99.5% pure. 
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initially, purification of the unliganded receptor can be obtained by conventional 
techniques, such as hydrophobic interaction chromatography (HPLC). ion exchange 
chromatography (HPLC), and heparin affinity chromatography. To achieve higher purification 
for .mproved crystals of nuclear receptors, especially the estrogen receptor, the receptors can be 
l.gand-shift-purified using a column that separates the receptor according to charge, such as an 
»on exchange or hydrophobic interaction column, and then bind the eluted receptor with a ligand 
especially an agonist. The ligand induces a change in the receptor's surface charge such that the 
hganded receptor elutes at a different position than the unliganded receptor. Usually saturating 
concentrations of ligand are used in the column and the protein can be preincubated with the 
hgand prior to passing it over the column. The structural studies detailed herein indicate the 
general applicability of this technique for obtaining super-pure nuclear receptor LBDs for 
crystallization. 

Purification can also be accomplished by use of a purification handle or "tag," such as 
with a histidine amino acid engineered to reside on the end of the protein, such as on the N- 
terminus, and then using a nickel or cobalt chelation column for purification. (Janknecht Proc 
Natl Acad. Sd. US£, 88:8972-8976 (1991)) incorporated by reference. Typically purified 
LBD, such as ERa LBD, is equilibrated at a saturating concentration of ligand at a temperature 
that preserves the integrity of the protein. Ligand equilibration can be established between 2 and 
37°C, although the receptor tends to be more stable in the 2-20°C range. Preferably crystals are 
made with the hanging drop methods detailed herein. Regulated temperature control is desirable 
to .mprove crystal stability and quality. Temperatures between 4 and 25°C are generally used 
and it is often preferable to test crystallization over a range of temperatures. The crystals are 
then subjected to vapor diffusion and bombarded with x-rays to obtain x-ray diffraction pattern 
following standard procedures. 

For co-crystallization with a ligand that binds the ligand binding domain, alone or in 
conjunction with a peptide that binds to the coactivator binding site, various concentrations of 
ligands and peptides containing a sequence that binds to a coactivator binding site of a nuclear 
receptor of interest can be used in microcrystallization trials, and the appropriate compounds 
selected for further crystallization. Ligands and peptides can be assayed for binding to the ligand 
binding domain and coactivator binding sues of a nuclear receptor of interest by any number of 
techniques, including those assays described herein. For crystallization trials with the ERa 
LBD. the hanging drop vapor diffusion method is preferred. Conditions of pH. solvent and 
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solute components and concentrations and temperature can be adjusted, for instance, as 
described in the Examples. In the handing drop method, to obtain suitable crystals for x-ray 
diffraction analysis, seeding of prepared drops with microcrystals of the complex can be used. 
Collection of structural information can be determined by molecular replacement using the 
structure of the ERct LBD determined herein. The structure is refined following standard 

techniques known in the art. 

There are many uses and advantages provided by the present invention. For example, the 
methods and compositions described herein are useful for identifying peptides, peptidomimetics 
or small natural or synthetic organic molecules that modulate nuclear receptor activity. The 
compounds are useful in treating nuclear receptor-based disorders. Methods and compositions of 
the invention also find use in characterizing structure/function relationships of natural and 
synthetic ligands. 

The following discussion of provides an understanding of the basis for the examples, 
along with the results obtained and/or the conclusions reached. 

As described above, many coactivators recognize agonist bound nuclear receptor LBDs 
through the sequence LXXLL (SEQ ID NO: 1), where L is leucine and X is any amino acid (the 
NR. box). The structure of the DES-hERct LBD-GRIP1 peptide complex reveals that the 
LXXLL motif (SEQ ID NO: I) forms the core of a short amphipathic a helix which is recognized 
by a highly complementary hydrophobic groove on the surface of the receptor. In agreement 
with the conclusions of other mutational and structural studies (Brzozowski, et aL. supra and 
Feng, et al., supra), it is believed that this peptide binding groove formed by residues from 
helices 3, 4, 5 and 12 and the turn between helices 3 and 4 is the surface of ERa involved with 
transcriptional activity, i.e.. the coactivator binding site. Further, structural studies of the 
complex between TRp and the GRIP1 NR box II peptide and biochemical studies of GRIP 1 
binding to TRp and GR (Darimont, et al., supra ) and study of the general features of the 
PPARy/SRC-l peptide complex (Nolte, et al., supra ) are similar to those of the ERct/GRIPl NR 
box II peptide complex described herein, suggesting that the mechanisms of NR box recognition 
are conserved across the nuclear receptor family. 

Of the eleven AF-2 residues whose side chains interact with the coactivator helix (Figure 
3A), only four (Lys 362. Leu 379. Gin 375 and Glu 542) are highly conserved across the nuclear 
receptor family (Wurtz. et al., supra ). The side chains of Gin 375 and Leu 379 are 
predominantly buried even in the absence of GRJP1 binding and appear to form integral parts of 
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the architecture of the surface involved in transcriptional activity. In contrast, the side chains of 
Lys 362 and Glu 542 are largely solvent exposed in the absence of coactivator and make both 
nonpolar contacts and the only receptor-mediated polar interactions with the coactivator helix 
These two capping interaction residues are perfectly positioned at opposite ends of the 
> coactivator binding site groove not only to stabilize the main chain confonnation of the 

coactivator but also to function as a molecular caliper; the 15A distance between Lys 362 and 
Glu 542 is well suited to measure off the ~I . A axial length of the short, two-turn coactivator a 
helix (Figure 3C). Similar receptor-mediated capping interactions have also been observed in a 
complex between the TR P LED and the NR box II peptide (Darimont, et al., ^ Mutation of 
1 0 either of these two residues severely cripples coactivator binding by ERa as well as by TR P (see 
the Examples, Apriletti, et al., sur^a and Feng, et al.. supra and Henttu. et al.. sunra) Hence the 
formation of helix capping interactions may be a general feature of coactivator recognition by 
nuclear receptors. 

The side chains of six hydrophobic AF-2 residues (He 358, Leu 372, Val 376 Leu 379 
1 5 Leu 539 and Met 543) form a large part of the highly cooperative network of van der Waals 

contacts made by the receptor with the hydrophobic face of the coactivator helix (Figures 3A and 
3C). Mutations in He 358, Val 376 and Leu 539 abrogate GRIP, binding (see the Exam P ,es and 
Feng, et al., supra). Although these residues are, in general, more poorly conserved across 
nuclear receptors than either Lys 362 or Glu 542, their hydrophobic character, with the exception 
~0 of Leu 372, is conserved. Since the different NR LBDs adopt the same overall fold (Moras et 
al- Cur, Opin. Cejl Biol. ,0:384-91 (1998)), it follows that the hydrophobic re gl ons of different 
nuclear receptor coactivator binding site surfaces are distinctly textured. In support of this 
hypothesis, the NR box II peptide used in crystallization inhibited binding of GRIP, to the LBDs 
of the ERa, the TR P and the glucocorticoid receptor (GR) with very different efficiencies (Ding 
25 et al.. supra V v e ' 

The hydrophobic face of the NR box helix is formed by the side chains of the three motif 
leucines and the isoleucine preceding the motif (He 689). The functional importance of the 
conserved leucines in receptor binding has been demonstrated by numerous in ruro and /„ vivo 
stud,es. In contrast, the role of the residue preceding the motif in receptor binding has been 
,0 poorly characterized. Both biochemical and structural data implicate He 689 as a key receptor 
bindmg determinant. In the crystal, only the side chains of the motif leucines and He 689 
extensively contact the LBD in both noncrystallographic symmetry related peptide, Mutation of 
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He 689 to alanine reduces the ability of the peptide to inhibit the binding of GRIP1 to ERa by 
^30 fold in a competition assay (data not shown). The side chain of He 689 lies in a rather 
chemically distinct environment; this residue forms van der Waals contacts with the aliphatic 
portion of the Asp 538 side chain, the side chain of Leu 539 and the y-carboxylate of Glu 542 
(Figures 1 A and 3 A). The proximity of this negatively charged moiety of Glu 542 to the 
hydrophobic side chain of He 689 should enhance the electrostatic potential of the side chain 
carboxylate and strengthen its stabilizing interactions with the N-terminus of the coactivator 
helix. Despite its apparently crucial role in receptor recognition, the identity of the residue 
immediately preceding known NR boxes is poorly conserved. This sequence variability should 
have effects not only on packing interactions with ERa but also on both the chemical 
environment and the critically important orientation of Glu 542. This should in turn translate 
into variations in affinity for the receptor. Indeed, the three NR boxes from GRIP1. which each 
contain a different residue preceding the LXXLL motif (SEQ ID NO:l), have differing affinities 
for ERa (Ding, et al., supra ; Voegel, et al., EMBO J. 17:507-19 (1998)). 

The NR boxes of TIF2, the human homologue of GRIP 1, have been found to be partially 
functionally redundant despite their individual differences in receptor binding affinities. All 
three of NR boxes of TIF2 must be mutated to completely eliminate interaction with the ERa 
(Voegel, et al., EMBO J. 1 7:507-19 (1 998)). Our data indicate that a single NR box peptide is 
sufficient to form a tight complex with a single ERa LBD. Yet pi 60 coactivators possess 
multiple NR boxes. A possible explanation for the presence of multiple NR boxes is that they 
provide coactivators with broad specificity. Receptor binding relies upon the intricate formation 
of multiple van der Waals interactions yet the various nuclear receptors appear to have different 
coactivator binding site surfaces. The different amino acids in the position immediately 
preceding the LXXLL motif (SEQ ID NO:l) might allow some degree of adaptability to these 
distinct surfaces; however, there may be no NR box sequence that is capable of efficiently 
binding to all nuclear receptors. Multiple NR boxes may therefore provide coactivators the 
diversity of interfaces necessary to recognize a variety of targets. 

ERa transcriptional activity is blocked by antagonists such as OHT and RAL. The most 
striking feature of the structures of the OHT and RAL liganded ERa LBDs is that helix 12 is 
bound to the static region of the coactivator recognition groove (Figure 3B and (Brzozowski. et 
al., supra ). A comparison of these two structures with the structure of the coactivator/LBD 
complex reveals that in the antagonist complexes, the region of helix 12 with an NR box-like 
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sequence (LXXML versus LXXLL) (SEQ.ID NO:2 versus SEQ ID NO: 1 ) functions as an 
intramolecular mimic of the coactivator helix (Figure 5 and Brzozowski. et al., supra ). 
Consistent with the proposals of others (Brzozowski, et al.. supra and Darimont. et al., supra l 
thi:; disposition of helix 12 directly affects the structure and function of the surface responsible 
5 for transcriptional activity in two ways. First, because helix 12 residues form an integral part of 
the coactivator binding site surface, the surface is incomplete when helix 12 is in the antagonist- 
bound conformation. In particular, Leu 539, Glu 542 and Met 543 are incorrectly oriented for 
coactivator recognition. Second, residues from the static region of this surface are bound to helix 
12 and are prevented from interacting with coactivator (Figures 3 A and 3B). 
10 The sequence similarity of helix 12 of the ERa LBD to the LXXLL motif (SEQ ID 

NO:l) appears to be unique among nuclear receptors. The' identities of the residues in this region 
of helix 12 in other nuclear receptors, although generally hydrophobic in character, do not as 
closely resemble the sequence of an NR. box as those of ERa (Wurtz. et al., supra). However, it 
is possible that an intramolecular inhibitor with a suboptimal recognition sequence would 
1 5 compete for coactivator binding given its extremely high local concentration. 

The binding of OHT to the ERa promotes a helix 12 conformation that inhibits binding 
of the coactivator. OHT does not directly interact with any helix 12 residues (Figure 4B). 
Moreover, the structure of the LBD in the region of the coactivator binding site groove that 
interacts with helix 1 2 in the OHT complex is the same in the DES and E 2 complexes (Figures 
20 3A, 3B and 5). 

Numerous studies have demonstrated the importance of the OHT side chain in receptor 
antagonism (Jordan, et al., supra and Robertson, et al., J. Med. Chem . 25:167-71 (1982)). A 
comparison of the structures of the OHT and DES complexes reveals that the binding mode of 
the OHT side chain precludes the agonist-induced conformation of helix 12. The OHT side 
chain projects out of the ligand binding pocket between helices 3 and 1 1 (Figures 2B, 6B and 
6C). As a result, the positioning of helix 12 over the ligand binding pocket, as it is in the 
agonist-bound conformation, would bury the positively charged dimethylamino group of the 
OHT side chain within a hydrophobic cavity and produce steric clashes between the 
dimethylaminoethyl region of side chain and the side chain of Leu 540. 

In functional terms. OHT is not, however, simply "an agonist with a side chain". OHT 
binding promotes a conformation of the LBD that is distinct from that stabilized by either DES 
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or E 2 binding. These different conformations impose different restrictions on the positioning of 
helix 12. 

Helices 3, 8 and 1 1 in the DES and E 2 complexes are between one to two turns longer 
than they are in the OHT complex (Figure 6A and (Brzozowski, et aL supra ). Helix 1 1 ends at 
5 Cys 530 in the DES and E 2 complexes and it ends at Tyr 526 in the OHT complex. Helix 12 

begins at Leu 536 in the OHT complex. This appears to be necessary; in the antagonist complex, 
Leu 536 forms a cooperative network of nonpolar contacts and hydrogen bonds with Glu 380 
and Tyr 537 that stabilizes the N-terminus of helix 12 (Figure IB). Therefore, if helix 12 were to 
bind the static region of the coactivator binding site in the presence of agonist, the loop 
10 connecting helices 1 1 and 12 would be required to span -17 A over five residues. Although 
theoretically possible, this conformation would be highly strained and hence unlikely. In 
contrast, the longer loop connecting helices 1 1 and 12 in the OHT complex allows helix 12 to 
extend to the static region of the coactivator binding groove. 

In the DES and E 2 complexes, helix 12 and the loop connecting helices 1 1 and 12 pack 
15 against helices 3 and 11, whereas they do not in the OHT complex (Figures 2A and 2B and 

(Brzozowski, et ah, supra ). A recently described structure of the E 2 -LBD complex suggests that 
the longer helices in the DES and E 2 complexes are not dependent upon the interactions helix 12 
forms in the agonist-bound conformation (Tanenbaum, et al., supra V In this structure, a crystal 
packing artifact forces helix 12 to contact a symmetry-related molecule. Helix 12 is clearly not 
20 positioned over the ligand binding pocket in this structure. Nevertheless, helices 3, 8 and 1 1 are 
longer than they are in the OHT complex (Figure 6A). Hence the longer helices of the agonist 
complexes occur independently of the positioning of helix 12 over the ligand binding pocket and 
are instead a direct result of agonist binding. 

The secondary structure differences between the agonist complexes and the OHT 
25 complex arise from distinct arrangements of packing interactions induced by the different 
ligands. A cooperative network of van der Waals contacts, organized around DES or E 2 , 
between various hydrophobic residues from helices 3, 7, 8 and 1 1 and the p hairpin appears to 
stabilize the longer helices in the agonist complexes (Figure 4A and 6D). The placement of the 
OHT B ring forces many of ligand binding pocket residues that surround it to adopt 
30 conformations that are dramatically different from those they adopt in either the DES or E 2 

structures. As a result, many of the interresidue packing interactions present in the DES and E 2 
structures are either absent or altered in the OHT structure (Figure 6D). These structural 
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distortions apparently force the main chain from residues 339 to 341, 421 to 423. and 527 to 530 
(which form parts of helices 3, 8 and 1 1 respectively in the agonist structures ) to adopt an 
extended conformation in the OHT structure (Figures 6A-D). 

Therefore the binding of OHT has two distinct effects on the positioning of helix 12. each 
of which contributes to antagonism (Figure 7). Helix 12 is prevented from being positioned over 
the ligand binding pocket by the OHT side chain. In addition, the alternative packing 
arrangement of ligand binding pocket residues around OHT stabilizes a conformation of the 
LBD that permits helix 12 to reach the static region of the coactivator binding site and mimic 
bound coactivator. 

These mechanisms do not appear to be specific to OHT. The side chain of RAL. like that 
of OHT, sterically hinders the agonist-bound conformation of helix 12 (Brzozowski, et al.. 
supra). In addition, helix 1 1 appears to end at Met 528 in the RAL complex. This may result 
from the distortions in the binding pocket in the vicinity of His 524 directed by RAL. binding 
(Brzozowski, et al., supra ). 

These results are supported by the experimental data provided in the examples below, 
which are also intended to illustrate various aspects of this invention. These examples do not 
limit the scope of this invention. 

EXAMPLES 

2Q Example 1 

Experimental Procedures 

Protein Expr ession anH Purification 

The human ERcx-LBD 297-554 was overexpressed as described previously (Seielstad. et 
al-, Mol. Endocrinol . 9:647-658 (1995)) in BL21(DE3)pLysS cells transformed with a modified 
pET-23d-ERG vector that contained the sequence Met-Asp-Pro fused to residues 297 through 
554 of the hERct (provided by Paul Sigler of Yale University). Clarified bacterial lysates were 
adjusted to 3 M in urea and 0.7 M in NaCl and then applied to a 10-ml column of estradiol- 
Sepharose (Greene, et al., Proc. Natl. Acad. Sci. USA 77:5 1 1 5-5 1 1 9 (1 980); Landel. et al.. Mol. 

End ° crino1 - 8:1407-1419 (1994); Landel, et al.. J. Steroid Biochem . Molec . BjpJ 63 59-73 
(1997)). 

To carboxymeihylate the solvent-accessible cysteines, the bound hERa-LBD was treated 
with 5 mM iodoacetic acid in 1 0 mM Tris. pH 8. 1 . 250 mM NaSCN (Hegy, et al.. Steroids 
61:367-373 (1996)). Protein was eluted with 3 x 10-5 M ligand (either DES or OHT) in 30-100 
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ml of 50 mM Tris. 1 mM EDTA. 1 mM DTT and 250 mM NaSCN. pH 8.5. The yield of hERa- 
LBD was typically close to 100% (Seielstad. et al.. Biochemistry 34:12605-12615 (1995)). The 
affinity-purified material was concentrated and exchanged into 20 mM Tris. 1 mM EDTA. 4 mM 
DTT. pH 8.1 by ultrafiltration. The protein was bound to a Resource Q column (Pharmacia) and 
then eluted with a linear gradient of 25-350 mM NaCl in 20 mM Tris. pH 8.L 1 mM DTT. The 
hERcc-LBD-ligand complexes eluted at 150-200 mM NaCl. Pooled fractions were concentrated 
by ultrafiltration and analyzed by SDS-PAGE, native PAGE, and electrospray ionization mass 
spectrometry. 
GST-pulldown Assays 

A fusion between glutathione-S-transferase (GST) and amino acids 282-595 of hERct 
was constructed by subcloning the EcoRI fragment from pSG5 ERa-LBD (Lopez et al., 
submitted manuscript) into pGEX-3X (Pharmacia). The He 358-> Arg, Lys 362->Ala. and Leu 
539->Arg mutations were introduced into the GST-LBD construct using the QuikChange Kit 
(Stratagene) according to the manufacturer's instructions. The Val 376->Arg and Glu 542->Lys 
mutations were created in the GST-LBD construct by subcloning the Bsml/Hindlll fragments of 
derivatives of pSG5-ER-HEGO (Tora. et al., EMBO J. 8:1981-6 (1989)) into which these 
mutations had already been introduced. All constructs were verified by automated sequencing 
(University of Chicago Cancer Research Center DNA Sequencing Facility). 

The wild-type and mutant GST-LBDs were expressed in BL21(DE3) cells. Total ligand 
binding activity was determined by a controlled pore glass bead assay (Greene, et al., Mol. 
Endocrinol . 2:714-726 (1988)) and protein levels were monitored by western blotting with a 
monoclonal antibody to hERa (H222). Cleared extracts containing the GST-LBDs were 
incubated in buffer alone (50 mM Tris, pH 7.4, 150 mM NaCl, 2 mM EDTA, 1 mM DTT, 0.5% 
NP-40 and a protease inhibitor cocktail) or with 1 uM of either DES or OHT for 1 hour at 4°C. 
Extract samples containing thirty pmol of GST-LBD were then incubated with 10 ul glutathione- 
Sepharose-4B beads (Pharmacia) for 1 hour at 4°C. Beads were washed five times with 20 mM 
HEPES, pH 7.4, 400 mM NaCl, and 0.05% NP-40. 35 S-labeled GRIP1 was synthesized by in 
vitro transcription and translation using the TNT Coupled Reticulocyte Lysate System (Promega) 
according to the manufacturer's instructions and pSG5-GRIPl (provided by Michael Stallcup of 
the University of Southern California) as the template. Immobilized GST-LBDs were incubated 
for 2.5 hours with 2.5 ul aliquots of crude translation reaction mixture diluted in 300 (al of Tris- 
buffered saline (TBS). After five washes in TBS containing 0.05% NP-40. proteins were eluted 
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by boiling the beads for 10 minutes in sample buffer. Bound 35 S-GRIP1 was quantitated by 
fluorography following SDS-PAGE. 
Crystallization and Data Collection 

Crystals of the DES-hERcc LBD-GRIP1 NR Box II peptide complex were obtained by 
hanging drop vapor diffusion. Prior to crystallization, the DES-hERcc LBD (residues 297-554) 
complex was incubated with a 2-4 fold molar excess of the GRIP1 NR Box II peptide (SEQ ID 
NO:4) for 7-16 hr. Two p.L samples of this solution were mixed with equal volume samples of 
reservoir buffer consisting of 25-27% (w/v) PEG 4000. 90 mM Tris (pH 8.75-9.0) and 180 mM 
Na Acetate and suspended over wells containing 800 uL of the reservoir buffer. After 4-7 days 
at 19-21°C. rod-like crystals were obtained. The coactivator complex crystals lie in the 
spacegroup P2, with cell dimensions a=54.09, b=82.22. c=58.04 and (J=l 1 1 .34. Two molecules 
each of the DES-LBD and the coactivator peptide form the asymmetric unit. A 200 |am x 40 urn 
x 40 urn crystal was transferred to a cryosolvent solution containing 25% (w/v) PEG 4000, 10% 
(w/v) ethylene glycol, 100 mM Tris (pH 8.5), 200 mM Na Acetate and 10 M M peptide and frozen 
in an N 2 stream at -1 70°C in a rayon loop. Diffraction data from this crystal were measured at - 
170°C using a 300 mm MAR image plate at the Stanford Synchrotron Radiation Laboratory 
(SSRL) at beamline 7-1 at a wavelength of 1.08A. 

Crystals of the hERa LBD complexed to OHT were obtained by the hanging drop vapor 
diffusion method. Equal volume aliquots (2 uL) of a solution containing 3.9 mg/mL protein- 
hgand complex and the reservoir solution containing 9% (w/v) PEG 8000, 6% (w/v) ethylene 
glycol, 50 mM HEPES (pH 6.7) and 200 mM NaCl were mixed and suspended over 800 uL of 
the reservoir solution. Hexagonal plate-like crystals formed after 4-7 days at 2 1 -23 °C. Both 
crystal size and quality were improved through microseeding techniques. These crystals belong 
to the space group P6 5 22 with cell parameters a=b=58.24A and c=277.47A. The asymmetric 
unit consists of a single LBD monomer; the dimer axis lies along a crystallographic two-fold. A 
single crystal (400 urn x 250 urn x 40 urn) was briefly incubated in a cryoprotectant solution 
consisting of 10% (w/v) PEG 8000, 25% (w/v) ethylene glycol. 50 mM HEPES (pH 7.0) and 200 
mM NaCl and then flash frozen in liquid N 2 suspended in a rayon loop. Diffraction data were 
measured at -1 70°C using a 345 mm MAR image plate at SSRL at beamline 9-1 and at a 
30 wavelength of 0.98A. 
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The images of both data sets were processed with DENZO and scaled with 
SCALEPACK (Otwinowski. et al.. Methods Enzvmol . 276:307-326 (1997)) using the default - 
3a cutoff. 

Structure Determination and Refinement 
5 Initial efforts to determine the structure of the DES-LBD-GRIP1 NR Box II peptide 

complex utilized a low resolution (3.1 A) data set (data not shown). A self-rotation search 
implemented with POLARRFN ('The CCP4 suite: programs for protein crystallography", Acta 
Crvstalloer . D 50:760-763 (1994)) indicated the presence of a noncrystallographic dyad. The 
two LBDs in the asymmetric were located by molecular replacement in AMoRe (CCP4, 1994) 

10 using a partial polyalanine model of the human RARy LBD (Renaud, et ah, supra ) as the search 
probe (R=58.2%, CC=35.6% after placement of both monomers). Given that the model at this 
point was both inaccurate (r.m.s.d. 1 .7 A between this model and the final model based on Ca 
positions) and incomplete (accounting for only -45% of the total scattering matter in the 
asymmetric unit), an aggressive density modification protocol was undertaken. Iterative cycles 

15 of two-fold NCS averaging in DM (CCP4, 1994) interspersed with model building in MOLOC 
(Muller, et al., Bull . Soc . Chim . Belg. 97:655-667 (1988)) and model refinement in REFMAC 
(Murshudov, et al., Acta Crvstallogr . D 53:240-255 (1997)) (using tight NCS restraints) were 
used to quickly build a model of the LBD alone. For this procedure, MAMA (Kleywegt, et al., 
"Halloween.. .masks and bones. In From First Map to Final Model", Bailey, et al, eds., 

20 Warrington, England, SERC Daresbury Laboratory, 1994) was used for x a!l mask manipulations 
and PHASES (Furey, et al.. PA33 Am. Cryst . Assoc . Mtg. Abstr . 18:73 (1990)) and the CCP4 
suite (CCP4, 1 994) were used for the generation of structure factors and the calculation of 
weights. 

However, although the DES-LBD-GRIP1 NR Box II peptide complex model accounted 
25 for -90% of the scattering matter in the asymmetric unit, refinement was being hampered by 
severe model bias. The OHT complex data set was then collected. Starting with one of the 
monomers of the preliminary DES-LBD model as the search probe, molecular replacement in 
AMoRe was used to search for the location of LBD in this crystal form in both P6i22 and P6_<;22. 
A translation search in P6 5 22 yielded the correct solution (R~53.8%, CC=38.2%). In order to 
30 reduce model bias, DMMULTI (CCP4. 1994) was then used to project averaged density from the 
DES complex cell into the OHT complex cell. Using MOLOC a model of the LBD was built 
into the resulting density. The model was refined initially in REFMAC and later with the 
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simulated annealing, positional and B-factor refinement protocols in X-PLOR (Brunger, X- 
PLOR Version 3.843. New Haven. Connecticut: Yale University. 1996) using a maximum- 
likelihood target (Adams, et al., Proc . Natl. Acad . Sci . USA 94:5018-23 (1997)). Anisotropic 
scaling and a bulk solvent correction were used and all B-factors were refined isotropically. 
Except for the R free set (a random sampling consisting of 8% of the data set), all data between 41 
and 1.9A (with no a cutoff) were included. The final model consisted of residues 306-551, the 
ligand and 79 waters. According to PROCHECK (CCP4, 1994). 91.6% of all residues in the 
model were in the core regions of the Ramachandran plot and none were in the disallowed 
regions. 

The high resolution data set of the DES-LBD-GRIP 1 NR Box II peptide complex became 
available when the R free of the OHT-LBD model was -31%. Both monomers in the asymmetric 
unit of the DES complex crystal were relocated using AMoRe and the incompletely refined 
OHT-LBD model (with helix 12 and the loop between helices 1 1 and 12 removed) as the search 
model. The missing parts of the model were built and the rest of the model was corrected using 
MOLOC and two-fold averaged maps generated in DM. Initially, refinement was carried out 
with REFMAC using tight NCS restraints. At later stages, the model was refined without NCS 
restraints using the simulated annealing, positional and B-factor refinement protocols in X- 
PLOR and a maximum-likelihood target. All B-factors were refined isotropically and 
anisotropic scaling and a bulk solvent correction were used. The Rf ree set contained a random 
sample of 6.5% of all data. In refinement, all data between 27 and 2.03A (with no a cutoff) were 
used. The final model was composed of residues 305-549 of monomer A. residues 305-461 and 
470-554 of monomer B, residues 687-697 of peptide A, residues 686-696 of peptide B, two 
ligand molecules, 1 47 waters, two carboxymethyl groups and a chloride ion. According to 
PROCHECK, 93.7% of all residues in the model were in the core regions of the Ramachandran 
25 plot and none were in the disallowed regions. 

Figure 1 A provides a view of a 2Fo-Fc electron density map calculated at 2.03 A 
resolution and contoured at 1.0 a showing the GRJP1 NR box II interaction with the LBD. The 
GRIP1 NR Box II peptide (SEQ ID NO:4) was omitted from the model prior to map calculation. 
He 689 from the peptide and two of the three receptor residues with which it interacts (Glu 542 
and Leu 539) are labeled. Asp 538 has been omitted for clarity. The hydrogen bonds between 
the y-carboxylate of Glu 542 and the amides of residues 689 and 690 of the peptide are depicted 
as dashed orange bonds. Figure IB provides a view of a 2Fo-Fc electron density map calculated 
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at 1.90A resolution and contoured at 1.0 a showing the N-terminal region of helix 12. The 
dashed orange bonds depict the water-mediated hydrogen bond network between the imidazole 
ring of His 377. the y-carboxylate of Glu 380. and the amide of Tyr 537. The three labeled 
residues (Glu 380. Leu 536 and Tyr 537) interact with each other through van der Waals contacts 
5 and/or hydrogen bonds. Intriguingly, mutations in each these three residues dramatically 
increase the transcriptional activity of unliganded ERa LBD (Eng, et al., MpJ. Cell. Biol . 
17:4644-4653 (1997); Lazennec, et al.. Mol. Endocrinol . 1 1:1375-86 (1997); White, et al.. 
EMBO J. 16:1 427-35 ( 1 997)). 

Example 2 

10 Structure Determination 

GRIP1 , a mouse p 1 60 coactivator, interacts both in vivo and in vitro with the ERa LBD 
bound to agonist (Ding, et al., suoral but not with the LBD bound to antagonist (Norris. et al., J. 
Bid. CJiern. 273:6679-88 (1998)). Mutational studies of GRIP 1 and its human homologue TIF2 
suggest that of the three NR boxes from GRIP 1 , NR box II (residues 690 to 694) binds most 
1 5 tightly to the ERa LBD (Ding, et al., supra and Voegel, et al., supra). 

Competition assays indicate that a 13 residue GRIP1 NR Box II peptide, NH 2 - 
KHKILHRLLQDSS-C0 2 H (SEQ ID NO:4), corresponding to residues 686-698 of GRIP 1 (Ding, 
et al., supra), synthesized by standard solid phase methods, binds specifically to the agonist- 
bound ERa LBD (IC 50 <0.4uM; Kushner, unpublished) and to other agonist-bound NR LBDs 
20 (Ding, et al., supra and Darimont, at al, supra ). 

An electrophoretic mobility shift assay was used to demonstrate that the GRIP1 NR Box 
II peptide (SEQ ID NO:4) bound the ERa LBD in the presence of the agonist. DES, but not the 
antagonist, OHT. Eight microgram samples of purified hERa-LBD bound to either DES or 
OHT were incubated in the absence of the GRIP1 NR Box II peptide (SEQ ID NO:4), i.e.. buffer 
25 alone, or in the presence of either a 2-fold or 1 0-fold molar excess of the GRIP 1 NR Box II 
peptide. The binding reactions were performed on ice for 45 minutes in 10 ul of buffer 
containing 20mM Tris, pH 8.1, ImM DTT, and 200mM NaCl and then subjected to 6% native 
PAGE. Gels were stained with GELCODE Blue Stain reagent (Pierce). 

In the presence of the NR box II peptide, the migration of the DES-LBD complex was 
retarded. In contrast, peptide addition had no effect on the mobility of the OHT-LBD complex. 
Hence, this peptide fragment of GRIP 1 possesses the ligand-dependent receptor binding activity 
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characteristic of the full-length protein. These observations suggest that the GRIP1 NR Box II 
peptide is a valid model for studying the interaction between GRIP1 and the ERcx LBD. 

In order to characterize structurally the interaction between the GRIP1 NR Box II peptide 
and the ERcx LBD. recombinant human ERa LBD (residues 297-554) was crystallized bound to 
5 both DES and the GRIP1 NR Box II peptide. The ERa LBD bound to OHT was also 
crystallized in order to determine the mechanism by which this antagonist blocks 
coactivator/ERa interaction. X-ray diffraction data from these crystals were measured and the 
structures were determined by a combination of molecular replacement (using a modified 
version of the coordinates of the human retinoic acid receptor y (RARy) LBD. Renaud, et al., 

10 suora . as the search model) and aggressive density modification. 

The structure of the DES-ERa LBD-GRIP1 NR Box II peptide complex has been refined 
to a crystallographic R-factor of 19.9% (R free =25.0%) using data to 2.03 A resolution, as shown 
in Figure 1 A and Table 2. The structure of the OHT-ERa LBD complex has been refined using 
data to 1 .90A to a crystallographic R-factor of 23.0% (R frec =26.2%), as shown in Figure IB and 

15 Table 2. 

Table 2 



Summary of Crystallographic Statistics 

Ligand 





Data Collection 


DES 


OHT 


20 


Space group 


P2, 


P6 S 22 




Resolution 


2.03 


1.90 




Observations 


104189 


269253 




Unique 


30265 


23064 




Completeness (%) 


98.4 


99.1 


25 


Rsym(%) a 


7.8 


7.0 




Average I/al 


9.8 


16.1 




Refinement 








Number of non-hydrogen atoms 


4180 


2070 




Rcys, (%) b /Rfree (%) 


19.9/25.0 


23.0/26.1 


30 


Bond r.m.s. deviation (A) 


0.006 


0.006 




Angle r.m.s. deviation (°) 


1.05 


1.05 




Average B factor (A 2 ) 


34.0 


40.4 




Rsym =Si I li- <li> I 


/Zili where <Ij> 


is the averaj 




intensity over symmetry equivalent: 
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Example 3 

Overall Structure of the DES-LBD-GRIPl NR Box II Peptide Complex 
The asymmetric unit of the DES-LBD-GRIPl NR Box II peptide complex crystals 
contains the same noncrystallographic dimer of LBDs that has been observed in the previously 
5 determined structures of the LBD bound to both E 2 and RAL (Brzozowski, et al., supra and 

Tanenbaum, et al., supra ). Beyond the flexible loops between helices 2 and 3 and helices 9 and 
10, the two LBDs of the dimer adopt similar structures (r.m.s.d. 0.47A based on Cct positions). 
The conformation of each LBD complexed with DES closely resembles that of the LBD bound 
to E 2 (Brzozowski, et al.. supra ); each monomer is a wedge shaped molecule consisting of three 

10 layers of eleven to twelve helices and a single beta hairpin (Figure 2A). In each LBD, the 

hydrophobic face of helix 12 is packed against helices 3. 5/6 and 1 1 covering the ligand binding 
pocket (Figure 2A). One GRIP1 NR Box II peptide is bound to each LBD in a hydrophobic cleft 
composed of residues from helices 3, 4, 5 and 12 and the turn between 3 and 4 (Figures 2 A and 
3A). The density for both peptides in the asymmetric unit is continuous and unambiguous 

15 (Figure 1 A). Residues 687 to 697 from the GRIP1 NR Box II peptide, designated peptide A, and 
residues 686 to 696 from the GRIP1 NR Box II peptide, designated peptide B, have been 
modeled; the remaining residues are disordered. Given that each peptide lies within a different 
environment within the crystal, it is striking that from residues He 689 to Gin 695 each peptide 
forms a two turn, amphipathic a helix (Figures 2 A and 3 A). Flanking this region of common 

20 secondary structure, the peptides adopt dissimilar random coil conformations. 

The overall structures of the DES-ERa LBD-GRJP1 NR Box II peptide complex and the 
OHT-ERa LBD complex are illustrated in Figure 2. In Figure 2A, the coactivator peptide and 
the LBD are shown as ribbon drawings. The peptide is colored gold and helix 12 (residues 538- 
546) is colored magenta. Helices 3, 4 and 5 (labeled H3, H4 and H5 respectively) are colored 

25 blue. DES, colored green, is shown in space-filling representation. In Figure 2B, the LBD is 
depicted as a ribbon drawing. As in Figure 2A, helix 12 (residues 536-544) is colored in 
magenta and helices 3. 4 and 5 are colored blue. OHT, in red, is shown in space-filling 
representation. 

Example 4 

50 The NR Box II Peptide-LBD Interface 

The binding of the GRIP 1 NR Box II peptide to the ERa LBD buries 1000A 2 of 
predominantly hydrophobic surface area from both molecules. The GRJP1 NR Box II peptide 
binding site is a shallow groove composed of residues Leu 354. Val 355. He 358. Ala 361 and 
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Lys 362 from helix 3; Phe 367 and Val 368 from helix 4; Leu 372 from the turn between helices 
3 and 4; Gin 375 ? Val 376. Leu 379 and Glu 380 from helix 5: and Asp 538. Leu 539. Glu 542 
and Met 543 from helix 12 (Figure 3 A). The floor and sides of this groove are completely 
nonpolar. but the ends of this groove are charged (Figure 3C). 

5 The LBD interacts primarily with the hydrophobic face of the GRIP1 NR Box II peptide 

a helix formed by the -side chains of He 689 and the three LXXLL motif (SEQ ID NO:l) 
leucines (Leu 690, Leu 693 and Leu 694). The side chain of Leu 690 is deeply embedded within 
the groove and forms van der Waals contacts with the side chains of lie 358, Val 376, Leu 379. 
Glu 380 and Met 543 (Figure 3 A and 3C). The side chain of Leu 694 is similarly isolated within 

10 the groove and makes van der Waals contacts with the side chains of He 358, Lys 362, Leu 372, 
Gin 375, Val 376 and Leu 379 (Figure 3 A and 3C). In contrast, the side chains of both He 689 
and the second NR box leucine. Leu 693. rest against the rim of the groove (Figure 3A and 3C). 
The side chain of lie 689 lies in a shallow depression formed by the side chains of Asp 538, Leu 
539 and Glu 542. The side chain of Leu 693 makes nonpolar contacts with the side chains of He 

15 358 and Leu 539. 

The charged and polar side chains which form the hydrophilic face of the peptide helix 
project away from the receptor and either interact predominantly with solvent or form symmetry 
contacts (Figure 1 A). None of the side chains of the polar and charged residues outside the 
helical region of either peptide in the asymmetric unit, with the exception of Lys 688 of peptide 

20 B, is involved in hydrogen bonds or salt bridges with its associated LBD monomer. The e-amino 
group of Lys 688 of peptide B hydrogen bonds to the side chain carboxylate of Glu 380 of 
monomer B. This interaction is presumably a crystal artifact; the main chain atoms of the N- 
terminal three residues of peptide B are displaced from monomer B and interact extensively with 
a symmetry-related LBD. 

25 In addition to interacting with the hydrophobic face of the peptide helix, the LBD 

stabilizes the main chain conformation of the NR box peptide by forming capping interactions 
with both ends of the peptide helix. Glu 542 and Lys 362 are positioned at opposite ends of the 
peptide binding site (Figure 3 A). The side chains of Glu 542 and Lys 362 form van der Waals 
contacts with main chain and side chain atoms at the N- and C-terminal turns of the peptide helix 

30 respectively . These interactions position the stabilizing charges of the y-carboxylate of Glu 542 
and e-amino group of Lys 362 near the ends of the GRIP1 NR Box II peptide helix (Figure 3C). 
The side chain carboxylate (y-carboxylate) of Glu 542 hydrogen bonds to the amides of the 
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residues ot'N-terminal turn of the peptide helix (residues 688 and 689 of peptide A; residues 689 
and 690 of peptide B) (Figure 1 A). Similarly, the e-amino group of Lys 362 hydrogen bonds to 
the carbonyls of the residues of the C-terminal turn of the peptide helix (residue 693 of peptide 
A; residues 693 and 694 of peptide B) (Figure 5). 
5 To test the importance of the GRIP1 NR Box II peptide/LBD interface observed in the 

crystal, a series of site-directed mutations were introduced into the ERct LBD. These mutations 
were designed either to simultaneously perturb the structural integrity and the nonpolar character 
of the floor of the binding groove (He 358->Arg, Val 376->Arg and Leu 539->Arg) or to prevent 
the formation of the capping interactions (Lys 362->Ala and Glu 542->Lys). Fusions of 

10 glutathione-S-transferase (GST) to the wild-type and mutant LBDs were analyzed for their 

ability to bind 35 S-labeled GRIP1 in the absence of ligand or in the presence of DES or OHT. 

3:> S-labeled GRIP1 was incubated with either immobilized GST, immobilized wild type 
GST-hERa LBD, or immobilized mutant GST-LBDs in the absence of ligand or in the presence 
of DES or OHT. The bound GRIP1 was quantitated after SDS-PAGE. I358R, mutant LBD 

15 containing a Ile->Arg substitution at residue 358; K362A, mutant LBD containing a Lys->Ala 
substitution at residue 362; V376R, mutant LBD containing a Val->Arg substitution at residue 
376; L539R, mutant LBD containing a Leu->Arg substitution at residue 539; E542K, mutant 
LBD containing a Glu->Lys substitution at residue 542. 

In the absence of ligand or in the presence of OHT, fusions to the wild-type protein and 

20 all of the mutant LBDs showed no detectable binding to GRIP1. The He 358->Arg, Val 376- 
>Arg and Leu 539->Arg mutants were all unable to interact with coactivator in the presence of 
agonist, confirming the importance of the packing interactions observed in the crystal. 
Disruption of either the N- or C-terminal capping interaction also compromised GRIP I binding 
in the presence of agonist. Only the wild-type GST-LBD was able to recognize the coactivator 

25 in the presence of DES. 

Example 5 
Agonist Recognition 

Despite its different shape and larger molecular volume. DES (273 A 3 ) is accommodated 
within the same binding pocket that recognizes E 2 (252A 3 ). In its receptor complex. DES is 
30 completely encased within the narrower half of the LBD in a predominantly hydrophobic cavity 
composed of residues from helices 3, 6. 7, S. 1 1 . and 1 2 as well as the S 1/S2 hairpin (Figures 2 A 
and4A). 
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The interaction of DES with ERa resembles that of E 2 . One of the phenolic rings of DES 
lies in the same position as the E 2 A ring near helices 3 and 6. Like the aromatic ring of the E 2 , 
the DES A ring (Figure 4A) is engaged by the side chains of Phe 404. Ala 350, Leu 387 and Leu 
391 with its phenolic hydroxyl forming hydrogen bonds to the y-carboxylate of Glu 353. to the 
5 guanidinium group of Arg 394. and to a structurally conserved water molecule. The A' ring of 
DES (Figure 4 A) is bound near helices 7. 8 and 1 1 adjacent to the location of the E 2 C and D 
rings. This ring forms van der Waals contacts not only with Gly 521 and Leu 525, like the D 
ring of E 2 , but also with Met 343. Leu 346 and Met 421 (Figure 4A). Even though it is located 
1 -7A from the position of the D ring hydroxyl, the DES A' ring phenolic hydroxyl is still able to 
1 0 hydrogen bond to the imidazole ring of His 524 (Figure 4A). 

DES also forms contacts with the LBD that E 2 does not. There are unoccupied cavities 
adjacent to the a face of the B ring and the p face of the C ring of the E 2 (Brzozowski. et al.. 
suEia and Tanenbaum. et al., supra). The ethyl groups of DES. which project perpendicularly 
from the plane of the phenolic rings, fit snugly into these spaces. The resulting additional 
nonpolar contacts with the side chains of Ala 350, Leu 384. Phe 404, and Leu 428 (Figure 4A) 
may account for the higher affinity of DES for the receptor (Kuiper, et al.. Endocrinology 
138:863-70(1997)). 

Except for Met 421 and Met 528 (both of which contact only DES) and Met 388 and He 
424 (both of which contact only E 2 ), the ER is able to use the same residues to form all of the 
observed hydrogen bonds and van der Waals contacts with both agonists (Figure 4A, 
Brzozowski. et al.. suEra, and Tanenbaum. et al., supra). This remarkable adaptability is 
presumably the result of both relatively large molecular volume of the binding pocket (-500A 3 in 
both complexes) and its apparent structural plasticity. In particular, at the DES A' ring/steroid D 
ring end of the binding pocket. Met 343, Met 421. His 524 and Met 528 adopt different packing 
configurations in response to each ligand (data not shown). This plasticity may be necessary to 
allow the receptor to recognize endogenous estrogens such as estrone and estriol, which differ 
structurally from E 2 at the D ring. 
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Example 6 
Structure of th p QHT-LBD Complex 

The binding of OHT induces a conformation of the LBD that differs in both secondary 

and tertiary structural organization from that driven by DES binding. In the DES complex, the 

main chain from residues 339 to 341. 421 to 423. and 527 to 530 form parts of helices 3 8 and 
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1 1 respectively. In contrast, these regions adopt an extended conformation in the OHT complex 
(Figures 2 A. 2B and 6 A). In addition, the composition and orientation of helix 12 are different 
in the two structures. Helix 12 in the DES complex consists of residues 538 to 546 whereas 
helix 1 2 in the OHT complex consists of residues 536 to 544. Most dramatically, rather than 
5 covering the ligand binding pocket as it does in the DES complex, helix 1 2 in the OHT complex 
occupies the part of the coactivator binding groove formed by residues from helices 3, 4. and 5, 
and the turn connecting helices 3 and 4 (Figures 2A, 2B and 3B). This alternative conformation 
of helix 12 appears to be similar to that observed in the RAL complex (Brzozowski, et al., 
supra ). 

I o Example 7 

Helix 12-LBD Interface 

Except for the orientation of helix 12. the structure of the peptide binding groove is 

almost identical in the DES and OHT complexes (Figures 3 A and 3B). The region of this groove 

outside of helix 12 is referred to herein as the "static region" of the NR box binding site. Helix 

15 1 2 in the OHT complex and the NR box peptide helix in the DES complex interact with the static 

region of the coactivator recognition groove in strikingly similar ways. 

Helix 12 mimics the hydrophobic interactions of the NR box peptide with the static 

region of the groove with a stretch of residues (residues 540 to 544) that resembles an NR box 

(LLEML instead of LXXLL) (SEQ ID NO:3 instead of SEQ ID NO:l). The side chains of Leu 

20 540 and Met 543 lie in approximately the same locations as those of the first and second motif 

leucines (Leu 690 and Leu 693) in the peptide complex (Figure 5). Leu 540 is inserted into the 

groove and makes van der Waals contacts with Leu 354, Val 376 and Glu 380 (Figures 3B and 

3D). Met 543 lies along the edge of the groove and forms van der Waals contacts with the side 

chains of Leu 354, Val 355 and He 358 (Figures 3B and 3D). The side chain position of Leu 544 

25 almost exactly overlaps that of the third NR box leucine, Leu 694 (Figure 5). Deep within the 

groove, the Leu 544 side chain makes van der Waals contacts with the side chains of He 358, Lys 

362. Leu 372, Gin 375, Val 376 and Leu 379 (Figures 3B and 3D). 

Helix 12 in the OHT complex is also stabilized by N- and C-terminal capping 

interactions. Lys 362 interacts with the C-terminal turn of helix 12 much as it does with the 

30 equivalent turn of the peptide helix (Figures 3 A and 3B). The Lys 362 side chain packs against 

the C-terminal turn of the helix 12 with its e-amino group hydrogen bonding to the carbonyls of 

residues 543 and 544 (Figure 5). Given that the capping interaction at the N-terminal turn 

coactivator helix is formed bv a helix 12 residue (Glu 542). the N-terminal turn of helix 12 in the 

46 



BNSDOCID: <WO 99506 58A2J_> 



WO 99/50658 



PCT/US99/06937 



10 



15 



20 



25 



30 



antagonist complex is forced to interact with another residue. Glu 380 (Figures 3B and 3D). The 
Glu 380 y-carboxylate forms van der Waals contacts with Tyr 537 and interacts with the amide 
of Tyr 537 through a series of water-mediated hydrogen bonds (Figure IB). 

In addition to forming these "NR. box-like" interactions, helix 12 also forms van der 
Waals contacts with areas of the LBD outside of the coactivator recognition groove. The side 
chain of Leu 536 forms van der Waals contacts with Glu 380 and Tip 383 and that of Tyr 537 
forms van der Waals contacts with His 373, Val 376 and Glu 380 (Figures IB. 3B and 3D). As a 
result of the5e contacts, helix 12 in the OHT complex buries more solvent accessible surface area 
(-1200A 2 ) than the NR box peptide in the DES complex. 

Example 8 
OHT Recognition 

OHT is bound within the same pocket that recognizes DES, E 2 and RAL. The orientation 

of OHT within the binding pocket appears to be dictated by the positioning of two structural 

features of this ligand, the phenolic A ring and the bulky side chain (Figures 4B and 6C). The A 

ring of OHT is bound in approximately the same location as the A ring of DES near helices 3 

and 6 with its phenolic hydroxyl hydrogen bonding to a structurally conserved water and to the 

side chains of Glu 353 and Arg 394 (Figure 4B). Like the bulky side chain of RAL, the side 

chain of OHT exits the binding pocket between helices 3 and 1 1 (Figures 2B and 4B). The OHT 

C ring (Figure 4B) forms van der Waals contacts with the side chains of Met 343, Leu 346, Thr 

347, Ala 350, Tip 383. Leu 384, Leu 387 and Leu 525. The positioning of the flexible 

dimethylaminoethyl region of the side chain is stabilized by van der Waals contacts with Thr 

347, Ala 350 and Trp 383 and by a salt-bridge between the dimethylamino group of the side 

chain and the (J-carboxylate of Asp 351, which lies 3.8A away (Figure 4B). The positions of the 

A ring and the side chain in the context of the rigid triphenylethylene framework of OHT 

requires that the ethylene group of OHT lie in an orientation nearly orthogonal to that of the 

ethylene group of DES (Figures 4A, 4B and 6D). As a result, the B ring of OHT is driven more 

deeply into the binding pocket than' the A' ring of DES (Figures 6B and 6C). 

This location of the OHT B ring apparently cannot be accommodated by the same 

mechanisms that allow the DES A ring/E 2 D ring end of the binding pocket to adapt to the 

different structural features of DES and E,. Instead, the residues that contact the B ring (Met 

343, Leu 346. Met 421. lie 424, Gly 521, His 524 and Leu 525). most of which also interact with 

the A ring of DES. adopt conformations distinct from the ones they adopt in the DES structure 

(Figure 6D). In fact, the location of the B ring actually precludes the side chain of one residue 
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Met 421. from adopting the same conformation that it adopts in the DES structure (Figures 6B 
and 6CV As a consequence of these B ring-induced side chain conformations, many interresidue 
van der Waals contacts present in the DES complex are absent in the OHT complex. For 
example, whereas Met 421 packs against His 524 from helix 1 1 and against Met 343 from helix 
5 3 in the agonist complexes, it is precluded by the location of the OHT B ring from interacting 
with either of these residues in the antagonist complex (Figure 6D). 

The structural effects of the placement of the B ring are not limited to the residues that 
contact the B ring; the conformations of these residues force other residues throughout the 
binding pocket to, in turn, adopt alternative conformations. For instance, the conformation 
1 0 adopted by Met 42 1 in the OHT complex prevents the side chains of Phe 404 and Phe 425 from 
occupying the positions they take in the DES complex (Figure 6B and 6C). As a consequence, 
Phe 404 does not make van der Waals contacts with the OHT A ring as it does with the A rings 
of DES or E 2 (Figure 6C). In fact. Phe 404 only contacts the ethyl group of OHT (Figures 6C 
and 6D). The alternative conformations of the side chains of both the residues that directly 
15 contact the B ring and those that are indirectly affected by it, force the main chain throughout the 
binding pocket to adopt a different conformation as well (Figure 6D). 

Identification and characterization of key residues within ligand binding domain of the 
ERa and extension of this information to other nuclear receptors shows that these residues are 
common for all nuclear receptors identified to date. Thus, the Examples presented herein 
20 demonstrate that information derived from the structure and function of the ERa ligand binding 
domain can be applied in design and selection of compounds that modulate binding of 
compounds to nuclear receptors for all members of the nuclear receptor family. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
25 application was specifically and individually indicated to be incorporated by reference. 

The invention now being fully described, it will be apparent to one of ordinary skill in the 
art that many changes and modifications can be made thereto without departing from the spirit or 
scope of the appended claims. 
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Appendix 1 

Atomic Coordinates for Human ERa Complexed With DES and a GRJP1 NR. Box II Peptide 
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ATOM 412 O ASN A 359 

ATOM 413 N TRP A 360 

ATOM 414 CA TRP A 360 

ATOM 415 CB TRP A 360 

ATOM 416 CG TRP A 360 

ATOM 417 CD2 TRP A 360 

ATOM 418 CE2 TRP A 360 

ATOM 419 CE3 TRP A 360 



14.560 -8.490 -10.302 1.00 "»•> 50 

15.551 -9.343 -10.935 1.00 ^7*66 

15.353 -9.365 -12.466 1.00 ^4 35 

16.435 -10.214 -13.119 1.00 ^8 16 

13.957 -9.886 -12.798 1.00 ^1*59 

16.944 -8.811 -10.606 1.00 23 74 

17.857 -9.581 -10.291 1.00 ^351 

17.105 -7.489 -10.669 1.00 "M ?7 

18.392 -6.861 -10.369 1.00 ^1 31 

18.384 -5.390 -10.811 1.00 19 87 

18.494 -5.205 -12.295 1.00 V77 

17.543 -5.048 -13.248 1.00 *>1 66 

19.704 -5.177 -12.955 1.00 Vll 

19.496 -5.011 -14.249 1.00 ?4 96 

18.192 -4.931 -14.455 1.00 18 37 

18.702 -6.947 -8.875 1.00 ->141 

19.864 -7.111 -8.465 1.00 ">I 88 

17.660 -6.843 -8.058 1.00 ->\ 84 

17.837 -6.906 -6.610 1.00 ->l 51 

16.503 -6.668 -5.898 1.00 77*60 

16.629 -6.579 -4.369 1.00 19 36 

15.051 -6.755 -3.531 1.00 ~>3 64 

14.189 -5.332 -4.163 1.00 ?313 

18.411 -8.259 -6.192 1.00 ->3 69 

19.337 -8.328 -5.389 1.00 24 41 

17.856 -9.331 -6.746 1.00 07 14 

18.314 -10.672 -6.425 1.00 78 79 

17.529 -11.725 -7.232 1.00 32 42 

18.267 -13.064 -7.220 1.00 37*77 

16.125 -11.880 -6.644 1 00 31 94 

15.062 -12.196 -7.680 1.00 34 85 

19.801 -10.802 -6.728 1.00 ?8 75 

20.569 -11.305 -5.912 1.00 31 60 

20.207 -10.325 -7.897 1.00 ">7 9l 

21.601 -10.401 -8.293 1.00 ~>9 16 

21.721 -10.172 -9.801 1.00 31 88 

21.253 -11.381 -10.599 1.00 39 34 

21.916 -12.422 -10.612 1.00 41 27 

20.102 -11.255 -11.253 1.00 38 58 

22.476 -9.436 -7.510 1.00 30 75 

23.686 -9.629 -7.412 1.00 33 35 

21.872 -8.400 -6.940 1.00 30 07 

22.634 -7.451 -6.132 1.00 ->7 87 

21.849 -6.150 -5.948 1.00 24 80 

22.196 -5.392 -4.691 1.00 ^3 04 

21.501 -5.443 -3.438 1.00 19 83 

22.147 -4.543 -2.564 100 2^31 

20.392 -6.165 -2.972 1.00 ^0 09 



99506 58A2J_> 



PCT/US99/06937 

WO 99/50658 



5 



ATOM 


420 


CD1 


TRP 


A 


360 


ATOM 


421 


NE1 


TRP 


A 


360 


ATOM 


422 


CZ2 


TRP 


A 


360 


ATOM 


423 


CZ3 


TRP 


A 


360 


ATOM 


424 


CH2 


TRP 


A 


360 


ATOM 


425 


c 


TRP 


A 


360 


ATOM 


426 


O 


TRP 


A 


360 


ATOM 


427 


N 


ALA 


A 


361 


ATOM 


428 


CA 


ALA 


A 


361 


ATOM 


429 


CB 


ALA 


A 


361 


ATOM 


430 


c 


ALA 


A 


361 


ATOM 


431 


o 


ALA 


A 


361 


ATOM 


432 


N 


LYS 


A 


362 


ATOM 


433 


CA 


LYS 


A 


362 


ATOM 


434 


CB 


LYS 


A 


362 


ATOM 


435 


CG 


LYS 


A 


362 


ATOM 


436 


CD 


LYS 


A 


362 


ATOM 


437 


CE 


LYS 


A 


362 


ATOM 


438 


NZ 


LYS 


A 


362 


ATOM 


439 


C 


LYS 


A 


362 


ATOM 


440 


O 


LYS 


A 


362 



ATOM 441 N AARGA 363 

ATOM 442 N BARG A 363 

ATOM 443 CA AARGA 363 

25 ATOM 444 CA BARG A 363 

ATOM 445 CB AARGA 363 

ATOM 446 CB BARG A 363 

ATOM 447 CG AARGA 363 

ATOM 448 CG BARG A 363 

30 ATOM 449 CD AARGA 363 

ATOM 450 CD BARG A 363 

ATOM 451 NE AARGA 363 

ATOM 452 NE BARG A 363 

ATOM 453 CZ AARGA 363 

35 ATOM 454 CZ BARG A 363 

ATOM 455 NH1AARG A 363 

ATOM 456 NH1BARG A 363 

ATOM 457 NH2AARG A 363 

ATOM 458 NH2BARG A 363 

40 ATOM 459 C AARGA 363 

ATOM 460 C BARG A 363 

ATOM 461 O AARGA 363 

ATOM 462 O BARG A 363 

ATOM 463 N VAL A 364 

45 ATOM 464 CA VAL A 364 

ATOM 465 CB VAL A 364 

ATOM 466 CGI VAL A 364 

ATOM 467 CG2 VAL A 364 
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-4.340 


-1.243 


1.00 


20.43 


19.968 


-5.965 


-1.661 


1.00 


20.12 


20.635 


-5.057 


-0.812 


1.00 


18.54 


22.892 


-8.099 


-4.766 


1.00 


24.88 


23.978 


-7.980 


-4.198 


1.00 


25.00 


21.879 


-8.789 


-4.252 


1.00 


24.08 


21.972 


-9.462 


-2.958 


1.00 


26.06 


20.676 


-10.203 


-2.672 


1.00 


20.27 


23.161 


-10.433 


-2.897 


1.00 


28.44 


23.843 


-10.531 


-1.876 


1.00 


28.95 


23.414 


-11.144 


-3.992 


1.00 


31.41 


24.530 


-12.097 


-4.047 


1.00 


33.33 


24.564 


-12.824 


-5.390 


1.00 


34.81 


23.319 


-13.608 


-5.756 


1.00 


36.27 


23.458 


-14.178 


-7.167 


1.00 


38.30 


22.369 


-15.193 


-7.472 


1.00 


40.94 


22.111 


-15.322 


-8.937 


1.00 


42.49 


25.854 


-11.351 


-3.893 


1.00 


34.17 


26.880 


-11.977 


-3.595 


1.00 


35.40 


25.826 


-10.059 


-4.095 


0.50 


34.23 


25.826 


-10.059 


-4.095 


0.50 


34.03 



27.035 


-9.254 


-3.987 


0.50 


33.25 


27.035 


-9.254 


-3.987 


0.50 


32.83 


27.031 


-8.153 


-5.044 


0.50 


34.67 


27.031 


-8.153 


-5.045 


0.50 


34.20 


26.933 


-8.654 


-6.478 


0.50 


36.32 


26.930 


-8.654 


-6.480 


0.50 


35.56 


27.745 


-7.775 


-7.415 


0.50 


38.39 


27.752 


-7.781 


-7.414 


0.50 


37.18 


29.171 


-7.793 


-7.091 


0.50 


39.98 


27.195 


-7.725 


-8.762 


0.50 


37.39 


30.086 


-7.038 


-7.692 


0.50 


40.54 


27.905 


-7.457 


-9.855 


0.50 


40.02 


29.735 


-6.218 


-8.675 


0.50 


38.13 


29.205 


-7.191 


-9.761 


0.50 


40.42 


31.358 


-7.123 
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-8.798 
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34.97 


25.036 


-7.608 


1.700 


1.00 


35.44 


23.946 


-7.328 


-0.532 


1.00 


36.69 
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27.184 -9.157 0.428 1.00 "4 97 

26.878 -10.341 0.603 1.00 34 95 

28.306 -8.626 0.935 1.00 36 08 
28.775 -7.235 0.793 1.00 34*84 
29.231 -9.442 1.733 1.00 37 8? 
30.110 -8.408 2.430 1.00 34 3? 
30.127 -7.247 1.475 1.00 37 77 
28.538 -10.373 2.720 1.00 37 61 
27.692 -9.945 3.507 1.00 37 74 
28.890 -11.654 2.654 1.00 39 04 

28.307 -12.635 3.554 1.00 38 27 
26.991 -13.264 3.138 1.00 3932 
26.638 -14.336 3.635 1.00 39 53 
26.246 -12.615 2.236 1.00 38 60 
24.960 -13.148 1.783 1.00 36 36 
24.281 -12.178 0.808 1.00 3?'l0 
22.827 -12.473 0.581 1.00 30 12 
22.401 -13.083 -0.596 1.00 ^8 95 
21.882 -12.176 1.563 1.00 ^6 18 
21.050 -13.400 -0.792 1.00 ^9 42 
20.535 -12.491 1.373 1.00 27 60 
20.118 -13.103 0.196 1.00 26 81 
25.072 -14.519 1.117 1.00 36 82 
24.244 -15.398 1.359 1.00 36 55 
26.088 -14.694 0.276 1.00 38 28 
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ATOM 546 C ASP A 374 

ATOM 547 O ASP A 374 

ATOM 548 N GLN A 375 

ATOM 549 CA GLN A 375 

ATOM 550 CB GLN A 375 

ATOM 551 CG GLN A 375 

ATOM 552 CD GLN A 375 

ATOM 553 OE1 GLN A 375 

ATOM 554 NE2 GLN A 375 

ATOM 555 C GLN A 375 

ATOM 556 O GLN A 375 

ATOM 557 N VAL A 376 

ATOM 558 CA VAL A 376 

ATOM 559 CB VAL A 376 

ATOM 560 CGI VAL A 376 

ATOM 561 CG2 VAL A 376 

ATOM 562 C VAL A 376 

ATOM 563 O VAL A 376 
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16.599- -21.353 2.100 1.00 42.23 

16.318 -22.525 3.062 1.00 45.38 

15.114 -22.315 3.934 1.00 51.43 

13.808 -22.621 3.743 1.00 54.99 

15.187 -21.716 5.174 1.00 54.26 

13.979 -21.663 5.709 1.00 53.77 

13 124 -22.206 4.861 1.00 55.27 

16 665 -20.047 2.885 1.00 39.78 
15 677 -19.324 2.971 1 .00 37.71 

17 839 -19.738 3.440 1.00 36.38 
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-15.221 -2.569 14.634 1.00 46 51 
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-15.065 -9.692 12.016 1.00 42 88 

-16.274 -9.741 11.807 1.00 40 15 

-14.360 -10.733 12.447 1.00 41 92 

-14.980 -12.023 12.728 1.00 42 60 

-13.907 -13.091 12.927 1.00 44 77 

-15.844 -11.907 13.977 1.00 44 43 

-16.623 -12.804 14.296 1.00 44.09 
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-14.748 
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17.48 
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16.43 

15.57 
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HETATM 4149 


o 


HOH 


117 


HETATM 4150 


o 


HOH 


118 


HETATM 4151 


o 


HOH 


119 


HETATM 4152 


o 


HOH 


120 


HETATM 4153 


o 


HOH 


121 


HETATM 4154 


o 


HOH 


122 


HETATM 4155 


0 


HOH 


123 


HETATM 4156 


o 


HOH 


124 


HETATM 4157 


o 


HOH 


125 


T_TTTT \T\ A >I1 CO 

McIAIM 4158 


0 


HOH 


126 


HETATM 4159 


o 


HOH 


127 


HETATM 4160 


0 


HOH 


128 


HETATM 4161 


o 


HOH 


129 


HETATM 4162 


o 


HOH 


130 


HETATM 4163 


0 


HOH 
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36.525 


0.463 


20.792 


1.00 


45.26 


13.832 


9.696 


13.792 


1.00 


33.12 


31.166 


6.635 


24.924 


1.00 


35.19 


8.844 


-10.389 


34.180 


1.00 


48.80 


9.581 


-6.956 


34.136 


1.00 


42.95 


-1.563 


15.887 


27.596 


1.00 


39.35 


-5.286 


10.345 


32.757 


1.00 


35.20 


15.035 


0.607 


13.339 


1.00 


29.53. 


-10.984 


-1.500 


30.272 


1.00 


29.84 


-7.239 


-0.271 


-1.207 


1.00 


48.98 


18.022 


-4.902 


34.286 


1.00 


35.28 


29.347 


-6.319 


19.920 


1.00 


37.20 


-14.309 


-19.369 


20.945 


1.00 


30.23 


31.496 


4.614 


18.716 


1. 00 


38.79 


26.567 


9.759 


25.629 


1.00 


29.72 


2.848 


14.531 


1.134 


1.00 


38 08 


-9.373 


5.699 


-7.953 


1.00 


53.23 


-10.137 


-0.553 


-6.742 


1.00 


47 T> 


10.558 


-10.363 


15.403 


1.00 


40 97 


21.079 


17.166 


18.929 


1.00 




25.810 


-5.921 


22.506 


1.00 


37 69 


22.493 


-1.311 


34.465 


1.00 


49 94 


19.317 


10.977 


38.703 


1.00 


40.60 


4.479 


13.951 


3.045 


1.00 


45 31 


20.418 


19.353 


34.044 


1.00 


42.18 


-3.065 


8.936 


14.062 


1.00 


38.41 


26.856 


-4.674 


-10.940 


1. 00 


55.67 


2.032 


-6.387 


5.614 


1.00 


42 23 


0.601 


0.228 


-17.268 


1.00 


40.57 


4.903 


13.488 


-14.050 


1.00 


47.72 


3.986 


16.140 


-0.960 


1.00 


40.66 


12.968 


-1 9.561 


2.741 


1.00 


40.76 


7.170 


15.583 


2.599 


1.00 


43.69 


-1.966 


10.606 


3.572 


1.00 


52.63 


29.030 


10.644 


6.707 


1.00 


42.54 


0.468 


4.354 


8.374 


1.00 


38.69 


29.086 


17.119 


19.272 


1.00 


45.51 


24.614 


1 7.609 


20.174 


1.00 


53.55 


-15.318 


0.362 


26.686 


1.00 


36.77 


-3.857 


-24.786 


28.325 


1.00 


39.64 


21.728 


22.178 


31.983 


1.00 


43.73 


31.650 


-7.370 


21.642 


1.00 


40.53 


25.421 


10.436 


21.161 


1.00 


32.31 


10.317 


-9.457 


12.998 


1.00 


37.77 


22.723 


14.887 


15.427 


1.00 


47.90 


6.702 


9.556 


37.596 


1.00 


47.81 


27.987 


13.557 


7.167 


1.00 


41.15 


30.798 


16.499 


7.588 


1.00 


58.47 
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HETATM 4164 


O 


HOH 


132 


HETATM 4165 


O 


HOH 


133 


HETATM 4166 


O 


HOH 


134 


HETATM 4167 


O 


HOH 


135 


HETATM 4168 


O 


HOH 


136 


HETATM 4169 


0 


HOH 


137 


HETATM 4170 


O 


HOH 


138 


HETATM 4171 


O 


HOH 


139 


HETATM 4172 


O 


HOH 


140 


HETATM 4173 


O 


HOH 


141 


HETATM 4174 


O 


HOH 


142 


HETATM 4175 


O 


HOH 


143 


HETATM 4176 


O 


HOH 


144 


HETATM 4177 


O 


HOH 


145 


HETATM 4178 


O 


HOH 


146 


HETATM 4179 


O 


HOH 


147 



END 



10.071 


-0.571 


-20.393 


1.00 


38.79 


9.562 


8.334 


-21.392 


1.00 


36.80 


6.712 


6.058 


8.822 


1.00 


37.43 


5.927 


8.454 


10.594 


1.00 


42.34 


4.472 


6.306 


10.973 


1.00 


37.35 


6.792 


7.721 


7.051 


1.00 


47.23 


24.513 


11.582 


33.724 


1.00 


45.55 


-2.528 


-20.361 


12.354 


1.00 


52.13 


-7.864 


7.706 


19.248 


1.00 


47.82 


11.577 


-16.962 


24.398 


1.00 


39.43 


18.087 


12.263 


-5.507 


1.00 


33.36 


-6.816 


-14.190 


10.674 


1.00 


51.32 


-7.377 


-16.701 


33.528 


1.00 


57.11 


-5.379 


-20.107 


32.689 


1.00 


43.01 


8.766 


-7.947 


-16.274 


1.00 


49.96 


10.946 


-7.937 


-18.142 


1.00 


55.67 
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Appendix 2 

Atomic Coordinates for Human ERa Complexed With OHT 



10 



15 



20 



25 



30 



35 



40 



45 



CRYST1 58.242 58.242 277.467 90.00 



ORIGX1 
ORIGX2 
ORIGX3 
SCALE 1 
SCALE2 
SCALE3 

ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 
ATOM 
ATOM 
ATOM 



1 .000000 
0.000000 
0.000000 
0.017170 
0.000000 
0.000000 



0.000000 
1.000000 
0.000000 
0.009913 
0.019826 
0.000000 



0.000000 
0.000000 
1 .000000 
0.000000 
0.000000 
0.003604 



1 

1 


CB 


LEU 


306 


6.638 




C 


LEU 


306 


7.381 


-> 


o 


LEU 


306 


6.407 


4 


N 


LEU 


306 


6.369 




CA 


LEU 


306 


7.232 


O 


N 


ALA 


307 


8.609 


-7 

/ 


CA 


ALA 


307 


8.891 


O 

o 


CB 


ALA 


307 


10.318 


Q 


c 


ALA 


307 


8.692 




U 


AT A 

ALA 


307 


8.451 


1 1 


IN 


LEU 


308 


8.789 




f A 


LLU 


308 


8.638 


i J 


PD 


T CT T 


308 


9.298 


14 

I *T 




T CT T 

Lh,U 


308 


10.637 


] 5 


("Til 


T t?T T 


'i AO 

308 


10.474 


16 


CD2 


T FT T 


jUo 


I 1 .OV4 


17 


C 


LEU 


308 


7.190 


18 


O 


LEU 


308 


6.935 


19 


N 


SER 


309 


6.246 


20 


CA 


SER 


309 


4.828 


21 


CB 


SER 


309 


4.034 


22 


OG 


SER 


309 


4.071 


23 


C 


SER 


309 


4.261 


24 


O 


SER 


309 


3.166 


25 


N 


LEU 


310 


5.016 


26 


CA 


LEU 


310 


4.591 


27 


CB 


LEU 


310 


5.651 


28 


CG 


LEU 


310 


5.586 


29 


CD1 


LEU 


310 


5.530 


30 


CD2 


LEU 


310 


6.809 


31 


C 


LEU 


310 


4.330 


32 


O 


LEU 


310 


4.993 


-» 


N 


THR 


311 


3:352 


34 


CA 


THR 


311 


3.017 


35 


CB 


THR 


311 


1.527 



90.00 120.00 P 65 2 2 12 



0.00000 
0.00000 
0.00000 
0.00000 
0.00000 
0.00000 



11.502 
10.684 
11.020 
9.128 
10.330 
10.605 
10.912 
10.501 
12.393 
12.770 
13.228 
14.668 
15.402 
14.822 
14.189 
15.920 
15.130 
16.307 
14.208 
14.544 
13.896 
12.479 
14.095 
14.507 
13.257 
12.749 
11.811 
10.333 
10.200 
9.610 
13.865 
14.905 
13.641 
14.604 
14.554 



3.989 
6.231 
6.905 
4.588 
4.754 
6.730 
8.125 
8.465 
8.429 
9.574 
7.400 
7.573 
6.406 
5.948 
4.569 
5.933 
7.710 
7.961 
7.546 
7.657 
6.514 
6.588 
9.003 
9.398 
9.706 
11.004 
11.582 
11.189 
9.676 
11.739 
12.003 
11.984 
12.874 
13.912 
14.275 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1 .00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 



61.20 

61.47 

62.09 

62.32 

61.30 

60.52 

58.77 

59.70 

57.51 

57.64 

55.82 

56.62 

57.48 

59.17 

60.38 

58.46 

56.51 

55.58 

57.04 

56.46 

56.79 

57.23 

56.13 

55.17 

54.31 

53.55 

54.40 

56.49 

57.06 

57.28 

53.18 

53.17 

51.71 

49.93 

48.96 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



OG1 THR 311 



37 

38 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 



CG2 
C 
O 
N 

CA 

CB 

C 

O 

N 

CA 

CB 

CG 

OD1 

OD2 

C 

O 

N 

CA 

CB 

CG 

CD 

OE1 

NE2 

C 

O 

N 

CA 

CB 

CG 

SD 

CE 

C 

O 

N 

CA 

CB 

CGI 

CG2 

C 

O 

N 

CA 

CB 

OG 

C 

O 

N 



THR 
THR 
THR 
ALA 
ALA 
ALA 
ALA 
ALA 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
GLN 
GLN 
GLN 
GLN 
GLN 
GLN 
GLN 
GLN 
GLN 
MET 
MET 
MET 
MET 
MET 
MET 
MET 
MET 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
SER 
SER 
SER 
SER 
SER 
SER 
ALA 



311 
311 
311 
312 
312 
312 
312 
312 
313 
313 
313 
313 
313 
313 
313 
313 
314 
314 
314 
314 
314 
314 
314 
314 
314 
315 
315 
315 
315 
315 
315 
315 
315 
316 
316 
316 
316 
316 
316 
316 
317 
317 
317 
317 
317 
317 
318 



1.242 
0.666 
3.815 
4.371 
3.857 
4.590 
4.359 
4.171 
5.009 
2.868 
2.367 
0.848 
0.430 
1.314 
-0.785 
2.745 
2.959 
2.826 
3.182 
2.849 
1.534 
0.982 
1.649 
-0.248 
4.673 
5.100 
5.459 
6.901 
7.565 
9.082 
9.906 
9.547 
7.218 
8.002 
6.599 
6.819 
6.023 
6.245 
6.446 
6.404 
7.141 
5.215 
4.733 
3.311 
3.225 
5.696 
6.011 
6.182 



13.311 
14.688 
14.201 
13.103 
15.078 
14.798 
15.910 
13.460 
12.609 
13.275 
12.032 
12.100 
12.872 
13.234 
13.117 
10.846 
9.741 
11.081 
10.028 
10.464 
9.886 
10.646 
11.515 
10.318 
9.722 
8.580 
10.757 
10.606 
11.985 
11.939 
11.190 
12.408 
9.791 
8.841 
10.165 
9.476 
10.136 
9.373 
11.583 
8.012 
7.117 
7.767 
6.400 
6.402 
5.631 
5.601 



14.930 

13.027 

15.145 

15.197 

16.141 

17.369 

18.378 

17.964 

18.262 

18.143 

18.714 

18.879 

20.118 

20.929 

20.282 

17.835 

18.330 

16.531 

15.588 

14.156 

13.626 

12.428 

11.856 

12.043 

15.707 

15.555 

15.980 

16.130 

16.224 

16.356 

14.925 

13.680 

17.379 

17.335 

18.491 

19.756 

20.897 

22.192 

21.059 

19.664 

20.077 

19.127 

18.997 

18.415 

17.230 

18.114 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 ' 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 



47.20 

50.99 

48.84 

46.66 

48.76 

47.75 

47.06 

47.41 

45.52 

47.58 

47.63 

51.96 

56.21 

56.38 

59.15 

43.93 

44.77 

44.52 

44.73 

45.05 

48.47 

50.37 

49.38 

51.74 

43.26 

43.93 

42.29 

41.26 

42.43 

42.34 

46.22 

37.32 

38.89 

40.02 

37.65 

39.56 

39.22 

44.43 

41.04 

40.04 

37.86 

41.90 

41.68 

43.85 

49.38 

39.72 



4.446 18.407 1.00 40.21 
6;220 17.043 1.00 38.35 



138 



9950658A2 I > 



WO 99/50658 



PCT/US99/06937 



10 



15 



20 



25 



30 
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45 



ATOM 


84 


CA 


ALA 3 1 8 


ATOM 


85 


CB 


ALA 318 


ATOM 


86 


C 


ALA 318 


ATOM 


87 


O 


ALA 


> 318 


ATOM 


88 


N 


LEU 


319 


ATOM 


89 


CA 


LEU 


319 


ATOM 


90 


CB 


LEU 


319 


ATOM 


91 


CG 


LEU 


319 


ATOM 


92 


CD1 


LEU 


319 


ATOM 


93 


CD2 


LEU 


319 


ATOM 


94 


C 


LEU 


319 


ATOM 


95 


O 


LEU 


319 


ATOM 


96 


N 


LEU 




ATOM 


97 


CA 


LEU 




ATOM 


98 


CB 


LFI I 


**9fi 


ATOM 


99 


CG 


LFI I 




ATOM 


100 


CD1 


I FT I 




ATOM 


101 


CD2 


r ft i 




ATOM 


102 


C 


T FT T 




ATOM 


103 


o 


LETJ 


^9fi 


ATOM 


104 


N 


ASP 


JZ 1 


ATOM 


105 


CA 


ASP 




ATOM 


106 


CB 


ASP 


19 1 


ATOM 


107 


CG 


ASP 


19 1 


ATOM 


108 


OD1 


ASP 




ATOM 


109 


OD2 


ASP 


171 


ATOM 


110 


C 


ASP 


321 


ATOM 


111 


O 


ASP 




ATOM 


112 


N 


ALA 


322 


ATOM 


113 


CA 


ALA 


322 


ATOM 


114 


CB 


ALA 


329 


ATOM 


115 


C 


ALA 


322 


ATOM 


116 


O 


ALA 


322 


ATOM 


117 


N 


GLU 


323 


ATOM 


118 


CA 


GLU 


323 


ATOM 


119 


CB 


GLU 


323 


ATOM 


120 


CG 


GLU 


323 


ATOM 


121 


CD 


GLU 


323 


ATOM 


122 


OE1 


GLU 


323 


ATOM 


123 


OE2 


GLU 


323 


ATOM 


124 


C 


GLU 


323 


ATOM 


125 


O 


GLU 


323 


ATOM 


126 


N 


PRO 


324 


ATOM 


127 


CD 


PRO 


324 


ATOM 


128 


CA 


PRO 


324 


ATOM 


129 


CB 


PRO 


324 


ATOM 


130 


CG 


PRO 


324 


ATOM 


131 


C 


PRO 


324 



7.1 14 
7.485 
8.375 
8.820 
8.938 
10.161 
10.660 
11.136 
11.714 
12.182 
9.965 
10.779 
8.879 
8.567 
7.239 
7.236 
5.876 
8.334 
8.466 
8.971 
7.812 
7.613 
6.669 
5.206 
4.901 
4.357 
8.911 
9.030 
9.878 
11.153 
11.772 
12.148 
13.219 
11.799 
12.704 
12.042 
12.209 
13.657 
14.313 
14.134 
13.205 
12.425 
14.527 
15.522 
15.158 
16.633 
16.81 1 
14.940 
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5.540 
6.448 
5.137 
3.992 
6.089 
5.854 
7.174 
8.264 
9.440 
7.693 
4.826 
3.916 
4.982 
4.067 
4.467 
5.582 
• 5.634 
5.332 
2.642 
1.697 
2.504 
1.210 
1.372 
1.318 
1.422 
1.172 
0.565 
-0.661 
1.395 
0.905 
1.954 
0.513 
-0.020 
0.768 
0.460 
0.768 
2.210 
2.569 
3.173 
2.245 
-0.978 
-1.931 
-1.151 
-0.069 
-2.474 
-2.166 
-0.807 
-3.162 



16.153 

14.986 

16.920 

16.844 

17.664 

18.438 

19.040 

18.071 

18.857 

17.140 

19.549 

19.729 

20.297 

21.387 

22.049 

23.099 

23.802 

24.112 

20.843 

21.443 

19.696 

19.053 

17.860 

18.255 

19.464 

17.346 

18.568 

18.533 

18.193 

17.686 

16.776 

18.769 

18.473 

20.022 

21.117 

22.459 

22.899 

23.200 

22.326 

24.309 

21.110 

20.999 

21.225 

21.345 

21.240 

21.003 

21.610 

22.583 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 
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27.653 3.979 14.866 1.00 ^5 95 

28.402 4.236 12.762 1.00 ^586 
29.786 4.054 13.173 1.00 ^7 58 
30.730 4.036 11.968 1.00 30 36 
32.172 3.785 12.380 1.00 37 98 
33.080 3.471 11.210 1.00 45 r? 
32.869 4.048 10.120 1.00 42 99 
34.004 2.646 11.386 1.00 45 79 
30.218 5.159 14.133 1.00 ?7 50 
31.056 4.937 15.010 1.00 ^6 67 
29.637 6.339 13.965 0.75 24 89 
29.645 6.352 13.980 0.25 25 79 
29.969 7.466 14.826 0.75 ?4I2 
29.993 7.481 14.847 0.25 ?486 
29.621 8.781 14.122 0.75 25 96 
29.766 8.814 14.115 0.25 25 62 
30.698 9.192 12.732 0.75 31 63 
30.227 10.312 15.059 0.25 ?5 40 
29.237 7.422 16.162 0.75 22 07 
29.211 7.498 16.159 0.25 ?3 97 
29.812 7.730 17.206 0.75 21 97 
29.724 7.940 17.187 0.25 ?3 99 
27.974 7.012 16.128 1.00 ^3 41 
27.140 7.015 17.318 1.00 22 83 
25.785 7.587 16.948 1.00 ">5 50 
26.913 5.755 18.131 1.00 7539 
26.374 5.837 19.234 1.00 23 09 
27.311 4.602 17.615 1.00 ^5 98 
27.026 3.354 18.318 1.00 ?3 80 
27.669 2.172 17.580 1.00 ^2 52 
29.130 2.054 17.762 1.00 ?4 42 
29.797 1.347 18.803 1.00 27 31 
31.182 1.484 18.579 1.00 ?8 24 
29:360 0.609 19.912 1.00 7737 
30.102 2.578 16.965 1. 00 ^458 
31.342 2.239 17.446 1.00 ">7 35 
32.133 0.909 19.420 1.00 ?8 76 
30.305 0.039 20.745 1.00 28 09 
31.674 0.191 20.496 1.00 29 77 
27.356 3.309 19.802 1.00 23.54 
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CLAIMS 

WHAT IS CLAIMED IS: 

1 . A method of identifying a compound that modulates nuclear receptor activity, said method 
comprising: 

5 modeling test compounds that fit spatially into a nuclear receptor ligand binding domain of 

interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or portion thereof, 

screening said test compounds in an assay characterized by binding of a test compound to 
the ligand binding domain, and 
10 identifying a test compound that modulates nuclear receptor activity, wherein said atomic 

structural model comprises atomic coordinates of amino acid residues corresponding to 
residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387. 
Leu39L Arg394, Phe404, Met421, Leu428, Gly52L His524, Leu525 and Met528. 

2. The method of Claim 1 wherein the amino acid residues correspond to residues Met343, 
1 5 Met42 1 , His524, Leu525 and Met528. 

3. The method of Claim 1 wherein the test compound is an agonist and nuclear receptor 
activity is measured by binding of a coactivator to the coactivator binding site. 

4. The method of Claim 1 wherein the test compound is an antagonist and nuclear receptor 
activity is measured by the unwinding of helix 1 2. 

20 5. The method of Claim 1 wherein the test compound is an antagonist and nuclear receptor 
activity is measured by the blocking of coactivator binding. 

6. The method of Claim 1 wherein said screening is in vitro. 

7. The method of Claim 6 wherein said screening is high throughput screening. 

8. The method of Claim 1 wherein said test compound is from a library of compounds. 

25 9. The method of Claim 1 wherein said test compound is a small organic molecule, a peptide, 
or peptidomimetic. 

10. The method of Claim 1 which further comprises the step of providing the atomic 
coordinates of the estrogen receptor a ligand binding domain or portion thereof to a 
computerized modeling system, prior to said modeling step. 

The method of Claim 1 wherein said nuclear receptor is selected from the croup consisting 
of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid receptors. 
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progestin receptors, mineralocorticoid receptors, androgen receptors, peroxisome receptors 
and vitamin D receptors. 

2. The method of Claim 1 1 wherein said nuclear receptor is an estrogen receptor. 

3. The method of Claim 1 2 wherein said estrogen receptor is the estrogen receptor a. 

4. A method of identifying a compound that modulates ligand binding to a nuclear receptor, 
said method comprising: 

modeling test compounds that fit spacially into a nuclear receptor ligand binding domain of 
interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or portion thereof, 

screening said test compounds in an assay characterized by binding of a test compound to 
the binding domain, and 

identifying a test compound that modulates ligand binding to said nuclear receptor, 
wherein said atomic structural model comprises atomic coordinates of amino acid residues 
corresponding to residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353, 
Leu384, Leu387, Leu39K Arg394, Phe404, Met421, Leu428, Gly521, His524, Leu525 and 
Met528. 

15. The method of Claim 14 wherein the amino acid residues correspond to residues Met343, 
Met421, His524, Leu525 and Met528. 

16. The method of Claim 14 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors, 

1 7. The method of Claim 1 6 wherein said nuclear receptor is an estrogen receptor. 

1 8. The method of Claim 1 7 wherein said estrogen receptor is the estrogen receptor a. 

19. The method of Claim 14 wherein said screening is in vitro. 

20. The method of Claim 19 wherein said screening is high throughput screening. 

21. The method of Claim 14 wherein said test compound is from a library of compounds. 

22. The method of Claim 14 wherein said test compound is an agonist or antagonist of ligand 
binding. 

23. The method of Claim 14 wherein said test compound is a small organic molecule, a 
peptide, or peptidomimetic. 
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24. A method for identifying an agonist or antagonist of ligand binding to a nuclear receptor, 
said method comprising the steps of: 

providing the atomic coordinates of the estrogen receptor a ligand binding domain or 
portion thereof to a computerized modeling system, wherein said atomic coordinates are of 
5 the amino acid residues corresponding to residues of human estrogen receptor a Met343. 

Leu346, Ala350, Glu353, Leu384, Leu387. Leu391, Arg394, Phe404, Met421. Leu428, 
Gly52 1 , His524, Leu525 and Met528; 

modeling compounds which fit spacially into the ligand binding domain; and 
identifying in an assay for nuclear receptor activity a compound which increases or 
10 decreases the activity of the nuclear receptor by binding the ligand binding domain of said 

nuclear receptor, whereby an agonist or antagonist of ligand binding is identified. 

25. The method of Claim 24 wherein the amino acid residues correspond to residues Met343, 
Met42L His524, Leu525 and Met528. 

26. The method of Claim 24 wherein said nuclear receptor is selected from the group 

15 consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 

receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

27. The method of Claim 26 wherein said nuclear receptor is an estrogen receptor. 

28. The method of Claim 27 wherein said estrogen receptor is the estrogen receptor a. 
A method of modulating nuclear receptor activity in a mammal by administering to a 
mammal in need thereof a sufficient amount of a compound that fits spatially and 
preferentially into a ligand binding domain of a nuclear receptor of interest, where said 
compound is designed so as to distort at least one amino acid residue corresponding to 
residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, 

25 Leu391, Arg394, Phe404, Met421, Leu428, Gly521, His524, Leu525 and Met528. 

30. The method of Claim 29 wherein at least one amino acid residue corresponds to residues 
Met343. Met421, His524, Leu525 and Met528. 

3 1 . The method of Claim 29 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 

30 receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 

peroxisome receptors and vitamin D receptors. 

The method of Claim 3 1 wherein said nuclear receptor is an estrogen receptor. 
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33. The method of Claim 32 wherein said estrogen receptor is the estrogen receptor a. 

34. A machine-readable data storage medium, comprising a data storage material encoded with 
machine readable data which, when using a machine programmed with instructions for 
using said data* is capable of displaying a graphical three-dimensional representation of a 

5 molecular complex of a compound bound to a nuclear receptor ligand binding domain 

comprising structure coordinates of amino acids corresponding to human estrogen receptor 
a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu39L Arg394, Phe404, Met421, 
Leu428, Gly521, His524, Leu525 and Met528 or a homologue of said molecular complex, 
wherein said homologue comprises a ligand binding domain that has a root mean square 
1 0 deviation from the backbone atoms of said amino acids of not more than 1 .5A. 

35. The machine-readable data storage medium of Claim 34 wherein the amino acid residues 
correspond to residues Met343, Met421, His524, Leu525 and Met528. 

36. The machine-readable data storage medium of Claim 34 wherein said nuclear receptor is 
selected from the group consisting of estrogen receptors, thyroid receptors, retinoid 

15 receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, 

androgen receptors, peroxisome receptors and vitamin D receptors. 

37. The machine-readable data storage medium of Claim 36 wherein said nuclear receptor is 
an estrogen receptor. 

38. The machine-readable data storage medium of Claim 37 wherein said estrogen receptor is 
20 the estrogen receptor a. 

39. The machine-readable data storage medium of Claim 34 wherein said molecular complex 
is defined by the set of structure coordinates depicted in Appendix 1 or Appendix 2, or a 
homologue of said molecular complex, said homologue having a root mean square 
deviation from the backbone atoms of said amino acids of not more than 1.5 A. 

25 40. A machine-readable data storage medium comprising a data storage material encoded with 
a first set of machine readable data which, when combined with a second set of machine 
readable data, using a machine programmed with instructions for using said first set of data 
and said second set of data, can determine at least a portion of the structure coordinates 
corresponding to the second set of machine readable data, wherein: said first set of data 

30 comprises a Fourier transform of at least a portion of the structural coordinates selected 

from the group consisting of coordinates depicted in Appendix 1 or Appendix 2; and said 
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second set of data comprises an X-ray diffraction pattern of a molecule or molecular 
complex. 

41. The machine-readable data storage medium of Claim 40 wherein said nuclear receptor is 
selected from the group consisting of estrogen receptors, thyroid receptors, retinoid 
receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, 
androgen receptors, peroxisome receptors and vitamin D receptors. 

42. The machine-readable data storage medium of Claim 41 wherein said nuclear receptor is 
an estrogen receptor. 

43. The machine-readable data storage medium of Claim 42 wherein said estrogen receptor is 
the estrogen receptor a. 

44. A cocrystal of a nuclear receptor comprising an agonist bound to the ligand binding 
domain and a molecule bound to the coactivator binding site of the nuclear receptor, 
wherein said crystal defracts with at least 2.03A resolution. 

45. The cocrystal of Claim 44 wherein said nuclear receptor is the estrogen receptor a. 

46. The cocrystal of Claim 45 wherein said estrogen receptor a is human. 

47. The cocrystal of Claim 46 wherein said molecule is peptide. 

48. The cocrystal of Claim 47 wherein said peptide comprises a NR-box amino acid sequence 
or derivative thereof. 

49. A cocrystal of a nuclear receptor comprising an antagonist bound to the ligand binding 
domain of the nuclear receptor, wherein said crystal defracts with at least 1.9A resolution. 

50. The cocrystal of Claim 49 wherein said nuclear receptor is the estrogen receptor a. 

5 1 . The cocrystal of Claim 50 wherein said estrogen receptor a is human. 

52. A computational method of designing a nuclear receptor ligand where at least one amino 
acid residue of a nuclear receptor LBD that corresponds to human estrogen receptor a 
Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Met421, 
Leu428, Gly521, His524, Leu525 and Met528, interacts with at least one first chemical 
moiety of said ligand. comprising the step of selecting at least one chemical modification 
of said first chemical moiety to produce a second chemical moiety with a structure to either 
decrease or increase an interaction between said interacting amino acid and said second 
chemical moiety compared to said interaction between said interacting amino acid and said 
first chemical moiety. 
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57. 



53. The method of Claim 52 wherein at least one amino acid residue corresponds to residues 
Met343. Met42 1 . His524, Leu525 and Met528. 

The method of Claim 52 further comprising determining a change in interaction between 
said interacting amino acid and said ligand after chemical modification of said first 
chemical moiety. 

The method of Claim 52 wherein said chemical modification enhances hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 
The method of Claim 52 wherein said chemical modification reduces hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 
The method of Claim 52 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

58. The method of Claim 57 wherein said nuclear receptor is an estrogen receptor. 

59. The method of Claim 52 wherein the estrogen receptor is the estrogen receptor a. 
20 60. The method of Claim 59 wherein the ligand is an agonist. 

61. The method of Claim 60 wherein the ligand is selected from the group consisting of 1 70- 
estradiol, diethylstilbestrol, moxestrol, mesohexestrol, coumestrol, A 9 -THC. o,p-DDT, 
zearalenone and kepone. 

The method of Claim 61 wherein the ligand is 1 7P-estradiol, and the first chemical moiety 
is a free carbon of the A' ring located at a position selected from the group consisting of 
C6a, C7a. CI 2a, CI 5a, CI 6a and CI 7a. 
63. The method of Claim 59 wherein the ligand is an antagonist. 

The method of Claim 63 wherein the ligand is selected from the group consisting of ICI 
1 64,384 and EM800. 

The method of Claim 59 wherein the ligand is a selective estrogen receptor modulator. 
The method of Claim 65 wherein the ligand is selected from the group consisting of 
tamoxifen, raloxifene and GW5638. 
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67. A method of modulating nuclear receptor activity in a mammal by administering to a 

mammal in need thereof a sufficient amount of a ligand that fits spatially and preferentially 
into a ligand binding domain of a nuclear receptor of interest, wherein said ligand is 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor ligand binding domain that corresponds to human estrogen receptor a Met343, 
Leu346, Ala350, Glu353. Leu384. Leu387, Leu39L Arg394. Phe404. Met421. Leu428. 
Gly521. His524, Leu525 and Met528, interacts with at least one first chemical moiety of 
said ligand, comprising the step of selecting at least one chemical modification of said first 
chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between said interacting amino acid and said second chemical 
moiety compared to said interaction between said interacting amino acid and said first 
chemical moiety. 

The method of Claim 67 wherein at least one amino acid residue corresponds to residues 
Met343, Met421, His524, Leu525 and Met528. 
The method of Claim 67 wherein said ligand is an antagonist. 

70. The method of Claim 67 wherein said ligand is an agonist. 

7 1 . The method of Claim 70 which further comprises administering a coactivator mimic 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor coactivator binding site that corresponds to human estrogen receptor a helix 3 

20 residues Leu354, Val355, Met357, Ile358, Ala36 1 and Lys362. helix 4 residue Phe367, 

helix 5 residues Gln375, Val376, Leu379 and Glu380, helix 6 residue Trp383. and helix 12 
residues Asp538, Leu539. Glu542, Met543 and Leu544, interacts with at least one first 
chemical moiety of said coactivator mimic, comprising the step of selecting at least one 
chemical modification of said first chemical moiety to produce a second chemical moiety 
with a structure to either decrease or increase an interaction between said interacting 
amino acid and said second chemical moiety compared to said interaction between said 
interacting amino acid and said first chemical moiety. 
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<210> 1 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> BINDING 
<222> (1) . . (5) 

<223> residues 2 and 3 can be any amino acid 
<400> 1 

Leu Xaa Xaa Leu Leu 
1 5 



<210> 2 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> BINDING 
<222> (1) . . (5) 

<223> residues 2 and 3 can be any amino acid 
<400> 2 

Leu Xaa Xaa Met Leu 
1 5 
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<210> 3 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Leu Leu Gin Met Leu 
1 5 



<210> 4 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Lys His Lys lie Leu His Arg Leu Leu Gin Asp Ser Ser 
1 5 10 



<210> 5 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
1 5 10 15 

Phe Cys Glu Leu Pro Cys Glu Asp Gin lie lie Leu Leu Lys Gly Cys 
20 25 30 

Cys 



<210> S 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> € 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 



<210> 7 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
15 10 15 

Phe Ser Glu Leu Pro Cys Glu Asp Gin lie lie Leu Leu Lys Cys Cys 
20 25 30 
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Cys 



<210> 8 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
1 5 10 



<210> 9 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Thr Lys Cys He He Lys He Val 
1 5 

Phe Thr Gly Leu Ser He Ala Asp 
20 

Cys 



Glu Phe Ala Lys Arg Leu Pro Gly 
10 is 

Gin He Thr Leu Leu Lys Ala Ala 
25 30 



<210> 10 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Pro Met Pro Pro Leu He Arg Glu Met Leu Glu Asn 
1 5 io 



<210> 11 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Asp Lys Gin Leu Phe Thr Leu Val 
1 5 

Phe Ser Glu Leu Pro Leu Asp Asp 
20 

Trp 



Glu Trp Ala Lys Arg He Pro His 
10 is 

Gin Val He Leu Leu Arg Ala Gly 
25 30 
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<212> PRT 

<213> Homo sapiens 
<400> 12 

Pro lie Asp Thr Phe Leu 
1 5 



Met Glu Met Leu Glu Ala 
10 



<210> 13 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Val Glu Ala Val Gin Glu lie Thr 
1 5 

Phe lie Asn Leu Asp Leu Asn Asp 
20 

Val 



Glu Tyr Ala Lys Asn lie Pro Gly 

10 15 

Gin Val -Thr Leu Leu Lys. Tyr Gly 

25 30 



<210> 14 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp 
15 10 



<210> 15 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 15 

Ser Tyr Ser lie Gin Lys Val He 
1 5 

Phe Arg Asp Leu Thr Ser Glu Asp 
20 

Ala 



Gly Phe Ala Lys Met He Pro Gly 
10 15 

Gin He Val Leu Leu Lys Ser Ser 
25 30 



<210> 16 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Lys Leu Thr Pro Leu Val Leu Glu Val Phe Gly Asn 
15 10 
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<210> 17 
<211> 33 
<212> PRT 

<213> Homo sapiens 



<400> 17 

Asp Arg Glu Leu Val His Met He Asn Trp Ala Lys Arg Val Pro Gly 
15 10 15 

Phe Val Asp Leu Thr Leu His Asp Gin Val His Leu Leu Glu Cys Ala 



Trp 



<210> 18 
<211> 12 
<212> PRT 

<213> Homo sapiens 



<400> 18 

Pro Leu Tyr Asp Leu Leu Leu Glu Met Leu Asp Ala 
15 10 



<210> 19 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Gly Arg Gin Val He Ala Ala Val Lys Trp Ala Lys Ala He Pro Gly 
15 10 15 

Phe Arg Asn Leu His Leu Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser 
20 25 30 



Trp 



<210> 20 
<211> 12 
<212> PRT 

<213> Homo sapiens 



<400> 20 

Glu Phe Pro Glu Met Leu Ala Glu He He Thr Asn 
15 10 



<210> 21 
<211> 33 
<212> PRT 

<213> Homo sapiens 
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Glu Arg Gin Leu Leu Ser Val val Lys Trp Ser Lys Ser Leu Pro Gly 



« 15 



Phe Arg Asn Leu His He Asp Asp Gin He Thr Leu He Gin Tyr Ser 
20 25 30 y 

Trp 



<210> 22 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Glu Phe Pro Glu Met Met Ser Glu Val lie Ala Ala 

<210> 23 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Gly Lys Gin Met He Gin Val Val Lys Trp Ala Lys Val Leu Pro Gly 

5 10 15 

Phe Lys Asn Leu Pro Leu Glu Asp Gin He Thr Leu He Gin Tyr Ser 
20 25 30 

Trp 



<210> 24 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Glu Phe Pro Ala Met Leu Val Glu He He Ser Asp 



10 



<210> 25 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 25 



Glu Arg Gin Leu Val His Val Val Lys Trp Ala Lys Ala Leu Pro Gly 

5 in 1 



io 15 



Phe Arg Asn Leu His Val Asp Asp Gin Met Ala Val He Gin Tyr Ser 

25 .30 
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Trp 



<210> 26 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 26 

Asp Phe Pro Glu Met Met Ala Glu He He Ser Val 
1 5 10 
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